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CIS-PRENYLTRANSFERASES FROM PLANTS 
FIELD OF THE INVENTION 
This invention is in the field of plant molecular biology. This invention pertains to 
5 nucleic acid fragments from plants encoding proteins that are homologs of the undecaprenyl 
diphosphate and dehydrodolichyl diphosphate synthases (cw-prenyltransferases) previously 
identified only in microbes. More specifically, this invention pertains to homologs from 
wheat, grape, soybean, rice, African daisy, rubber tree and pot marigold, 

BACKGROUND OF THE INVENTION 
10 Plants synthesize a variety of hydrocarbons built up of isoprene units (C 5 H 8 ), termed 

polyisoprenoids (Tanaka, Y. In Rubber and Related Polyprenols. Methods in Plant 
Biochemistry,, Dey, P. M. and Harborne, J. B., Eds., Academic Press: San Diego, 1991 ; 
Vol. 7, pp 519-536). Those with from 45 to 115 carbon atoms, and varying numbers of cis- 
and trans- (Z- and E-) double bonds, are termed polyprenols, while those of longer chain 
15 length are termed rubbers (Tanaka, Y. In Minor Classes of Terpenoids. Methods in Plant 
Biochemistry; Dey, P. M. and Harborne, J. B., Eds., Academic Press: San Diego, 1991; 
Vol. 7, pp 537-542). The synthesis of these compounds is carried out by a family of 
enzymes termed prenyltransferases, which catalyze the sequential addition of C5 units to an 
initiator molecule. 

20 The initiator molecules themselves are derived from isoprene units through the action 

of distinct prenyltransferases, and are allylic terpenoid diphosphates such as 

dimethylallyldiphosphate (DMAPP), but more usually the C 10 compound geranyl v 

diphosphate (GPP), the C 15 compound famesyl diphosphate (FPP) or the C20 compound 

geranylgeranyl diphosphate (GGPP). Genes encoding the enzymes which synthesize these 

25 allylic terpenoid diphosphates have been cloned from a number of organisms, including 

§H plants, and all of these genes encode polypeptides with conserved regions of homology 

(McGarvey et al., Plant Cell 7:1015-1026 (1995); Chappell, J, y Annu. Rev. Plant Physiol. 

HI Plant Mol Biol. 46:521-547 (1995)). AH of these gene products condense isoprene units i 

j the trans- configuration. Prenyltransferases which condense isoprene units in a cis- 

30 configuration have not been identified in higher animals or plants, nor have 

prenyltransferases catalyzing extension of the polyisoprenoid chain beyond the C20 

compound geranylgeranyl diphosphate. 

A gene encoding octaprenyl diphosphate (OPP) synthase from the bacterium E. coli 

was identified (Asai et al., Biochem. Biophys. Res. Commun. 202:340-345 (1994)), and more. 

35 recently, genes encoding bacterial undecaprenyl diphosphate (UPP) synthases (Shimizu 

etal.,7. Biol Chem. 273:19476-19481 (1998); Apfel et al., J. Bacteriol 181:483-492 

(1999)) and yeast dehydrodolichyl diphosphate (DedoI-PP) synthase (Sato et al., Mol Cell. 

Biol 19:471-483 (1999)) were identified. OPP synthase generates the ail-trans 
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polyisoprenoid side chain of biological quinones (ubiquinone-8, menaquinone-8 and 
dimethylmenaquinone-8), and its primary structure contains regions of similarity with GPP, 
FPP and GGPP synthases. UPP synthase and Dedol-PP synthase generate cis- 
polyisoprenoids, and their primary structures are related to each other but distinct from those 
5 of OPP, GPP, FPP and GGPP synthases. 

There are several suggested functions for plant polyisoprenoids. Terpenoid quinones 
are most likely involved in photophosphorylation and respiratory chain phosphorylation. 
Rubbers have been implicated in plant defense against herbivory, possibly serving to repel 
and entrap insects and seal wounds in a manner analogous to plant resins. The specific roles 
10 of the C45-C 1 1 5 polyprenols remain unidentified, although as with most secondary 

metabolites they too most likely function in plant defense. Short-chain polyprenols may also 
be involved in protein glycosylation in plants, by analogy with the role of dolichols in animal 
metabolism. 

The problem to be solved is to identify new plant genes having utility in plant 

1 5 defense mechanisms. Applicants have solved the stated problem by the identification of 

plant genes encoding plant czs-prenyltransferases. The present invention presents genes with 
significant homology to the bacterial UPP synthase and yeast Dedol-PP synthase from 
plants. The present invention shows that such genes are present in a range of plant species, 
including economically important crop plants such as cereals and the rubber tree Hevea 

20 brasiliensis, and thus are likely to be ubiquitous in plants. 

This invention pertains to the identification and characterization of EST sequences 
from wheat, grape, soybean, rice, African daisy, rubber tree and pot marigold encoding cis- 
prenyltransferase proteins from these species. 

SUMMARY OF THE INVENTION 

25 It is an object of the present invention to provide an isolated nucleic acid fragment 

encoding a plant cw-prenyltransferase protein selected from the group consisting of: (a) an 
isolated nucleic acid fragment encoding all or a substantial portion of the amino acid 
sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, 

30 SEQ ID NO: 1 8 and SEQ ID NO:20; (b) an isolated nucleic acid fragment that is 

substantially similar to an isolated nucleic acid fragment encoding all or a substantial 
portion of the amino acid sequence selected from the group consisting of SEQ ID NO:2, 
SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID 
NO: 14, SEQ ID NO: 16, SEQ ID NO: 18 and SEQ ID NO:20; (c) an isolated nucleic acid 

3 5 fragment encoding a polypeptide, the polypeptide having at least 4 1 % identity with the 
amino acid sequence set forth in SEQ ID NO:24 (d) an isolated nucleic acid fragment 
encoding having at least 50% identity with nucleic acid sequence as set forth in SEQ ID 
NO:23; (e) an isolated nucleic acid molecule that hybridizes with a nucleic acid sequence of 

2 
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(a) (b), (c) or (d) under the following hybridization conditions: 0.1X SSC, 0.1% SDS, 65 °C 
and washed with 0.2X SSC, 0.5% SDS;; (f) an isolated nucleic acid fragment that hybridizes 
with a nucleic acid sequence selected from the group consisting of SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:l 1, SEQ ID NO: 13, SEQ 
5 ID NO: 15, SEQ ID NO: 17 and SEQ ID NO: 19 under the following hybridization 

conditionsO.lX SSC, 0.1% SDS, 65 °C and washed with 0.2X SSC, 0.5% SDS; and (g) an 
isolated nucleic acid fragment that is complementary to (a), (b), (c), (d), (e) or (f). 

The invention further provides polypeptides encoded by the isolated nucleic acid 
fragments of the present invention, such as are presented in SEQ ID NO:2, SEQ ID NO:4, 
10 SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID 
NO: 16, SEQ ID NO: 18 and SEQ ID NO:20. 

In another embodiment the invention provides a chimeric gene comprising the 
isolated nucleic acid fragment of the present invention operably linked to suitable regulatory 
sequences. 

1 5 The invention additionally provides a method of altering the level of expression of a 

plant m-prenyltransferase protein in a host cell comprising: (a) transforming a host cell 
with the chimeric gene of the present invention and; (b) growing the transformed host cell 
produced in step (a) under conditions that are suitable for expression of the chimeric gene 
resulting in production of altered levels of a plant c/s-prenyltransferase protein in the 

* 

20 transformed host cell relative to expression levels of an untransformed host cell. The 
invention further provides that where the cis-prenyltransferase protein is expressed in a 
transformed plant that the defense mechanism of the plant will be modulated. 

The invention additionally provides transformed host cells comprising the chimeric 
- genes of the present invention. . 

25 In an alternative embodiment the invention provides methods of obtaining a nucleic 

i 

jj acid fragment encoding all or a substantial portion of the amino acid sequence encoding a 

\ plant cw-prenyltransferase protein using portions of the present nucleic acid sequences as 

i hybridization probes or as primers. 

; BRIEF DESCRIPTION OF THE DRAWINGS _ 

30 AND SEQUENCE DESCRIPTIONS 

Figure 1 shows a scheme for synthesis of GPP, FPP and GGPP from IPP and the 
synthesis of polyprenols from GPP, FPP and GGPP. 

Figure 2 shows an alignment of coding regions of cDNAs encoding homologs of 
bacterial undecaprenyl phosphate synthases from different plant species with those of a 
35 bacterial (Micrococcus luteus) and two yeast (rer2 y srtl) genes. 

Figure 3 shows an alignment of the deduced amino acid sequences of plant cis- 
prenyltransferases. 
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Figure 4 shows an alignment of the proteins derived from the partial plant cDNAs 
shown in Figure 2, with the deduced amino acid sequences of a bacterial {Micrococcus 
luteus) and two yeast (rer2, srtl) genes. 

Figure 5 A depicts the chromatogram (diode array detector response at 210nm) 
5 generated by LC-MS analysis of non-saponifiable material extracted from wild-type 
arabidopsis leaves. 

Figure 5 B depicts the chromatogram (diode array detector response at 210nm) 
generated by LC-MS analysis of non-saponifiable material extracted from leaves of 
arabidopsis transformed with a 35S::Hpt3 construct. 
1 0 Figure 5 C depicts the chromatogram (diode array detector response at 2 1 Onm) 

generated by LC-MS analysis of non-saponifiable material extracted from leaves of 
arabidopsis transformed with a 35S::rrl construct. 

Figure 5 D depicts the chromatogram (diode array detector response at 21 Onm) 
generated by LC-MS analysis of non-saponifiable material extracted from leaves of 
15 arabidopsis transformed with a 3 5S::Apt5 construct. 

Figure 5 E depicts the chromatogram (diode array detector response at 21 Onm) 
generated by LC-MS analysis of non-saponifiable material extracted from leaves of 
arabidopsis transformed with a 35S::S11 construct. 

Figure 6A depicts the extracted ion chromatogram for dodecaprenol (mass detector 
20 response to ions with m/z 8 16 to 8 1 8) generated by LC-MS analysis of non-saponifiable 
material extracted from wild-type arabidopsis leaves. 

Figure 6B depicts the extracted ion chromatogram for dodecaprenol (mass detector 
response to ions with m/z 816 to 818) generated by LC-MS analysis of non-saponifiable 
material extracted from leaves of arabidopsis transformed with a 35S ::Hpt3 construct. 
25 Figure 6C depicts the extracted ion chromatogram for dodecaprenol (mass detector 

response to ions with m/z 816 to 818) generated by LC-MS analysis of non-saponifiable 
material extracted from leaves of arabidopsis transformed with a 35S::rrl construct. 

Figure 6D depicts the extracted ion chromatogram for dodecaprenol (mass detector 
response to ions with m/z 816 to 818) generated by LC-MS analysis of non-saponifiable 
30 material extracted from leaves of arabidopsis transformed with a 35S::Apt5 construct. 

Figure 6E depicts the extracted ion chromatogram for dodecaprenol (mass detector 
response to ions with m/z 816 to 818) generated by LC-MS analysis of non-saponifiable 
material extracted from leaves of arabidopsis transformed with a 35S::SU construct. 

The invention can be more fully understood from the following detailed description 
35 and the accompanying sequence descriptions which form part of this application. 

The following sequence descriptions and sequences listings attached hereto comply 
with the rules governing nucleotide and/or amino acid sequence disclosures in patent 
applications as set forth in 37 C.F.R. §1.821-1.825 ("Requirements for Patent Applications 
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Containing Nucleotide Sequences and/or Amino Acid Sequence Disclosures - the Sequence 
Rules") and are consistent with World Intellectual Property Organization (WIPO) Standard 
ST2.5 (1998) and the sequence listing requirements of the EPO and PCT (Rules 5.2 and 
49.5(a-bis), and Section 208 and Annex C of the Administration Instructions). The 
5 Sequence Descriptions contain the one letter code for nucleotide sequence characters and the 
three letter codes for amino acids as defined in conformity with the IUPAC-IYUB standards 
described in Nucleic Acids Res. 13:3021-3030 (1985) and in the Biochemical Journal 
219:345-373 (1984) which are herein incorporated by reference. 

SEQ ID NO: 1 is the nucleotide sequence for the African daisy clone 
10 dms2c.pk005.c7. 

SEQ ID NO:2 is the deduced amino acid sequence for the African daisy 
dms2c.pk005.c7, encoded by SEQ ID NO:l. 

SEQ ID NO:3 is the nucleotide sequence for the Pot Marigold clone 
ecslc.pk009.pl 9. 

1 5 SEQ ID NO: 4 is the deduced amino acid sequence for the Pot Marigold clone 

ecslc.pk009.pl 9, encoded by SEQ ID NO:3. 

SEQ ID NO:5 is the nucleotide sequence for the Hevea clone ehb2c.pk001.il0. 

SEQ ID NO:6 is the deduced amino acid sequence for the Hevea clone 
ehb2c.pk001.il0, encoded by SEQ ID NO:5. 
20 SEQ ID NO:7 is the nucleotide sequence for the Hevea clone ehb2c.pk00 1 .d 1 7. 

SEQ ID NO: 8 is the deduced amino acid sequence for the Hevea clone 
ehb2c.pk001.dl7, encoded by SEQ ID NO:7. 

SEQ ID NO: 9 is the nucleotide sequence for the Hevea clone ehb2c.pk001.ol8. 

SEQ ID NO: 10 is the deduced amino acid sequence for the Hevea clone 
25 ehb2c.pk00 1 .o 1 8, encoded by SEQ ID NO:9. 

SEQ ID NO:l 1 is the nucleotide sequence for the grape clone vdblc.pk001.k23 . 

SEQ ID NO: 12 is the deduced amino acid sequence for the grape clone 
vdblc.pk001.k23, encoded by SEQ ID NO:l 1. 

SEQ ID NO: 13 is the nucleotide sequence for the rice clone rI0n.pkl 17.i23. 
30 SEQ ID NO: 14 is the deduced amino acid sequence for the rice clone rlOn.pkl 17.i23, 

encoded by SEQ ID NO: 13. 

SEQ ID NO: 15: is the nucleotide sequence for clone the rice clone rrl.pk0050.h8. 

SEQ ID NO: 16 is the deduced amino acid sequence for rrl ,pk0050.h8, encoded by 
SEQ ID NO: 15. 

35 SEQ ID NO: 1 7 is the nucleotide sequence for the soybean clone si 1 .pk0128.h7. 

SEQ ID NO: 18 is the deduced amino acid sequence for the soybean clone 

sll .pk0128.h7, encoded by SEQ ID NO:17. 

SEQ ID NO: 19 is the nucleotide sequence for the wheat clone wdk5c.pk005.f22. 

5 
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SEQ ID NO:20 is the deduced amino acid sequence for the wheat clone 
wdk5c.pk005.f22, encoded by SEQ ID NO: 19. 

SEQ ID NO:21 is the conserved Domain I. 

SEQ ID NO:22 is the conserved Domain V. 
5 SEQ ID NO:23 is the nucleotide sequence encoding a bacterial undecaprenyl 

phosphate synthase isolated from Micrococcus luteus. 

SEQ ID NO:24 is the deduced amino acid sequence of a bacterial undecaprenyl 
phosphate synthase isolated from Micrococcus luteus. 

SEQ ID NO:25 is the nucleotide sequence encoding a yeast undecaprenyl phosphate 
1 0 synthase isolated from the yeast strain rer2, 

SEQ ID NO:26 is the deduced amino acid sequence of a yeast undecaprenyl 
phosphate synthase isolated from the yeast strain rerl. 

SEQ ID NO:27 is the nucleotide sequence encoding a yeast undecaprenyl phosphate 
synthase isolated from the yeast strain srtl. 
1 5 SEQ ID NO:28 is the deduced amino acid sequence of a yeast undecaprenyl 

phosphate synthase isolated from the yeast strain srtl. 

SEQ ID NO's 29 -36 are primers used for the transformation of arabidopsis with 
various c/s-prenyltransferases genes. 

SEQ ID NO:37 is the nucleotide sequence of the Apt5 arabidopsis cw-prenyl 
20 transferase homolog. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention reports the isolation and characterization of cDNAs 
corresponding to genes homologous with microbial m-prenyltransferases as ESTs from 
wheat, grape, soybean, rice, African daisy, rubber and marigold. No such homologs have 

25 been described previously in these species. The level of expression of the genes described 
here can be altered in the plant by methods of cosuppression and overexpression. As they 
are previously undescribed genes involved in synthesizing a family of molecules with 
fundamental cellular roles as well as roles in plant defense, this can lead to novel phenotypes 
that are expected to be beneficial for crop protection, production or as industrial sources of 

30 polyisoprenoids. In addition, if the reduction in expression of one of the genes leads to a 
growth or developmental defect in the plant, this gene can be used as a novel herbicide 
target. All isolated proteins can be used as tools to study the elaboration of polymeric cis- 
isoprenoids by plants. This can lead to the identification of additional proteins that can be 
used as described above. Any related EST sequences can be directly used for the above 

35 described applications in crop plants. 

The following definitions are provided for the full understanding of terms and 

abbreviations used in this specification: 

"Polymerase chain reaction" is abbreviated PCR 

6 
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"Expressed sequence tag" is abbreviated EST 
"Open reading frame" is abbreviated ORF 

"SDS polyacrylamide gel electrophoresis" is abbreviated SDS-PAGE 

"UPPS" is the abbreviation for the specific undecaprenyl diphosphate synthases 
isolated from bacteria. 

"OPPS" is the abbreviation for the specific octaprenyl diphosphate synthases isolated 
from bacteria. 

"Dedol-PP" is dehydrodolichol diphosphate 

"DMAPP" is dimethyl allyl diphosphate 

"IPP" is isopentenyl diphosphate 

"GPP" is geranyl diphosphate 

"FPP" is farnesyl diphosphate 

"GGPP" is geranylgeranyl diphosphate 

The term "c/s-prenyltransferase" refers generally to a class of enzymes capable of 
catalyzing the sequential addition of C5 units to polyprenols and rubbers. Two examples of 
cis-prenyltransferases are the undecaprenyl diphosphate and dehydrodolichyl diphosphate 
synthases. 

The terms "isolated nucleic acid fragment" or "isolated nucleic acid molecule" refer 
to a polymer of RNA or DNA that is single- or double-stranded, optionally containing 
synthetic, non-natural or altered nucleotide bases. An isolated nucleic acid fragment or an 
isolated nucleic acid molecule in the form of a polymer of DNA may be comprised of one or 
more segments of cDNA, genomic DNA, or synthetic DNA. 

The terms "host cell" and "host organism" refer to a cell capable of receiving foreign 
. or heterologous genes and expressing those genes to produce an active gene product. - 
Suitable host cells include microorganisms such as bacteria and fungi, as well as plant cells. 

The term "plant defense response" refers to the ability of a plant to deter tissue 
damage by insects, pathogens such as fungi, bacteria or viruses, as well as herbivores. 

The term "fragment" refers to a DNA or amino acid sequence comprising a 
subsequence of the nucleic acid sequence or protein of the present invention. However, an 
active fragment of the present invention comprises a sufficient portion of the protein to 
maintain activity. 

As used herein, "substantially similar" refers to nucleic acid fragments wherein 

changes in one or more nucleotide bases result in substitution of one or more amino acids, 

but do not affect the functional properties of the protein encoded by the DNA sequence. 

" Substantially similar" also refers to nucleic acid fragments wherein changes in one or more 

nucleotide bases do not affect the ability of the nucleic acid fragment to mediate alteration of 

gene expression by antisense or co-suppression technology. " Substantially similar" also 

refers to modifications of the nucleic acid fragments of the instant invention such as deletion 

7 
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or insertion of one or more nucleotide bases that do not substantially affect the functional 
properties of the resulting transcript vis-a-vis the ability to mediate alteration of gene 
expression by antisense or co-suppression technology or alteration of the functional 
properties of the resulting protein molecule. It is therefore understood that the invention 
5 encompasses more than the specific exemplary sequences. 

For example, it is well known in the art that antisense suppression and co-suppression 
of gene expression may be accomplished using nucleic acid fragments representing less that 
the entire coding region of a gene, and by nucleic acid fragments that do not share 100% 
identity with the gene to be suppressed. Moreover, alterations in a gene which result in the 
1 0 production of a chemically equivalent amino acid at a given site, but do not effect the 

functional properties of the encoded protein, are well known in the art. Thus, a codon for the 
amino acid alanine, a hydrophobic amino acid, may be substituted by a codon encoding 
another less hydrophobic residue (such as glycine) or a more hydrophobic residue (such as 
valine, leucine, or isoleucine). Similarly, changes which result in substitution of one 
1 5 negatively charged residue for another (such as aspartic acid for glutamic acid) or one 

positively charged residue for another (such as lysine for arginine) can also be expected to 
produce a functionally equivalent product Nucleotide changes which result in alteration of 
the N-terminal and C-terminal portions of the protein molecule would also not be expected 
to alter the activity of the protein. Each of the proposed modifications is well within the 
20 routine skill in the art, as is determination of retention of biological activity of the encoded 
products. Moreover, the skilled artisan recognizes that substantially similar sequences 
encompassed by this invention are also defined by their ability to hybridize, under stringent 
conditions (0.1X SSC, 0.1% SDS, 65 °C), with the sequences exemplified herein. Preferred 
substantially similar nucleic acid fragments of the instant invention are those nucleic acid 
25 fragments whose DNA sequences are at least 80% identical to the DNA sequence of the 
nucleic acid fragments reported herein. More preferred nucleic acid fragments are at least 
W\ 90% identical to the identical to the DNA sequence of the nucleic acid fragments reported 

ijj herein. Most preferred are nucleic acid fragments that are at least 95% identical to the DNA 

sequence of the nucleic acid fragments reported herein. 
30 A " substantial portion" of an amino acid or nucleotide sequence comprising enough 

of the amino acid sequence of a polypeptide or the nucleotide sequence of a gene to 
a?fj putatively identify that polypeptide or gene, either by manual evaluation of the sequence by 

^ one skilled in the art, or by computer-automated sequence comparison and identification 

using algorithms such as BLAST (Basic Local Alignment Search Tool; Altschul, S. F., et al., 
35 (1993) J. Mol Biol 215:403-410; see also www.ncbi.nim.nih.gov/BLAST/). In general, a 
sequence often or more contiguous amino acids or thirty or more nucleotides is necessary in 
order to putatively identify a polypeptide or nucleic acid sequence as homologous to a 
known protein or gene. Moreover, with respect to nucleotide sequences, gene specific 
S0j 8 
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oligonucleotide probes comprising 20-30 contiguous nucleotides may be used in sequence- 
dependent methods of gene identification (e.g., Southern hybridization) and isolation (e.g., 
in situ hybridization of bacterial colonies or bacteriophage plaques). In addition, short 
oligonucleotides of 12-15 bases may be used as amplification primers in PCR in order to 
5 obtain a particular nucleic acid fragment comprising the primers. Accordingly, a 
"substantial portion" of a nucleotide sequence comprises enough of the sequence to 
specifically identify and/or isolate a nucleic acid fragment comprising the sequence. The 
instant specification teaches partial or complete amino acid and nucleotide sequences 
encoding one or more particular fungal proteins. The skilled artisan, having the benefit of 

10 the sequences as reported herein, may now use all or a substantial portion of the disclosed 
sequences for purposes known to those skilled in this art. Accordingly, the instant invention 
comprises the complete sequences as reported in the accompanying Sequence Listing, as 
well as substantial portions of those sequences as defined above. 

The term "sequence analysis software" refers to any computer algorithm or software 

1 5 program that is useful for the analysis of nucleotide or amino acid sequences. " Sequence 
analysis software" may be commercially available or independently developed. Typical 
sequence analysis software will include but is not limited to the GCG suite of programs 
(Wisconsin Package Version 9.0, Genetics Computer Group (GCG), Madison, WI), 
BLASTP, BLASTN, BLASTX (Altschul et al, J. Mol Biol 215:403-410 (1990), Vector 

20 NTI (InforMax Inc. 6 1 1 0 Executive Boulevard, Suite 400, North Bethesda, MD) and 

DNASTAR (DNASTAR Inc. 1228 S. Park Street, Madison, WI). Within the context of this 
application it will be understood that where sequence analysis software is used for analysis, 
that the results of the analysis will be based on the "default values" of the program 
referenced, unless otherwise specified. ; As used herein " default vales* 1 will mean any set of 

25 values or parameters which originally load with the software when first initialized. 

The term "percent identity" , as known in the art, is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by 
comparing the sequences. In the art, " identity" also means the degree of sequence 
relatedness between polypeptide or polynucleotide sequences, as the case may be, as 

30 determined by the match between strings of such sequences. " Identity" and " similarity" 
can be readily calculated by known methods, including but not limited to those described in: 
Computational Molecular Biology (Lesk, A. M., ed.) Oxford University Press, New York 
(1988); Biocomputing: Informatics and Genome Projects (Smith, D. W., ed.) Academic 
Press, New York (1993); Computer Analysis of Sequence Data. Part I (Griffin, A. M., and 

35 Griffin, H. G., eds.) Humana Press, New Jersey (1 994); Sequence Analysis in Molecular 
Biology (von Heinje, G., ed.) Academic Press (1987); and Sequence Analysis Primer 
(Gribskov, M. and Devereux, J., eds.) Stockton Press, New York (1991). Preferred methods 
to determine identity are designed to give the best match between the sequences tested. 

9 
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Methods to determine identity and similarity are codified in publicly available computer 
programs. Sequence alignments and percent identity calculations may be performed using 
the Megalign program of the LASERGENE bioinformatics computing suite. (DNASTAR 
Inc., Madison, WI). Multiple alignment of the sequences was performed using the Clustal 
5 method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with the default 

parameters (GAP PENALTY- 10, GAP LENGTH PEN ALT Y= 10). Default parameters for 
pairwise alignments using the Clustal method were KTUPLE 1, GAP PENALTY=3, 
WINDOW=5 and DIAGONALS SAVED=5. 

Suitable nucleic acid fragments (isolated polynucleotides of the present invention) 
10 encode polypeptides that are at least about 70% identical, preferably at least about 80% 
identical to the amino acid sequences reported herein. Preferred nucleic acid fragments 
encode amino acid sequences that are about 85% identical to the amino acid sequences 
reported herein. More preferred nucleic acid fragments encode amino acid sequences that 
are at least about 90% identical to the amino acid sequences reported herein. Most preferred 
15 are nucleic acid fragments that encode amino acid sequences that are at least about 95% 
identical to the amino acid sequences reported herein. Suitable nucleic acid fragments not 
only have the above homologies but typically encode a polypeptide having at least 50 amino 
acids, preferably at least 100 amino acids, more preferably at least 150 amino acids, still 
more preferably at least 200 amino acids, and most preferably at least 250 amino acids. 
20 " Codon degeneracy" refers to divergence in the genetic code permitting variation of 

the nucleotide sequence without effecting the amino acid sequence of an encoded 
polypeptide. Accordingly, the present invention relates to any nucleic acid fragment that 
encodes all or a substantial portion of present proteins as set forth in SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ IDNO:12, SEQ IDNO:14, SEQ 
25 ID NO: 1 6, SEQ ID NO: 1 8 and SEQ ID NO:20. The skilled artisan is well aware of the 
1 " codon-bias" exhibited by a specific host cell to use nucleotide codons to specify a given 

amino acid. Therefore, when synthesizing a gene for improved expression in a host cell, it 
is desirable to design the gene such that its frequency of codon usage approaches the 
frequency of preferred codon usage of the host cell. 
30 The term "complementary" is used to describe the relationship between nucleotide 

bases that are hybridizable to one another. Hence with respect to DNA, adenosine is 
complementary to thymine and cytosine is complementary to guanine. 

A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, such as a 
cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid molecule 
35 can anneal to the other nucleic acid molecule under the appropriate conditions of temperature 
and solution tonic strength. Hybridization and washing conditions are well known and 
exemplified in Sambrook, J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A 
Laboratory Manual Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring 
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Harbor (1989), particularly Chapter 1 1 and Table 11.1 therein (entirely incorporated herein 
by reference). The conditions of temperature and ionic strength determine the "stringency" 
of the hybridization. Stringency conditions can be adjusted to screen for moderately similar 
fragments, such as homologous sequences from distantly related organisms, to highly similar 
5 fragments, such as genes that duplicate functional enzymes from closely related organisms. 
Post-hybridization washes determine stringency conditions. One set of preferred conditions 
uses a series of washes starting with 6X SSC, 0.5% SDS at room temperature for 15 min, 
then repeated with 2X SSC, 0.5% SDS at 45°C for 30 min, and then repeated twice with 
0.2X SSC, 0.5% SDS at 50°C for 30 min. A more preferred set of stringent conditions uses 

10 higher temperatures in which the washes are identical to those above except for the 

temperature of the final two 30 min washes in 0.2X SSC, 0.5% SDS was increased to 60°C. 
Another preferred set of highly stringent conditions uses two final washes in 0.1 X SSC, 
0. 1% SDS at 65°C. Hybridization requires that the two nucleic acids contain complementary 
sequences, although depending on the stringency of the hybridization, mismatches between 

15 bases are possible. The appropriate stringency for hybridizing nucleic acids depends on the 
length of the nucleic acids and the degree of complementation, variables well known in the 
art. The greater the degree of similarity or homology between two nucleotide sequences, the 
greater the value of Tm for hybrids of nucleic acids having those sequences. The relative 
stability (corresponding to higher Tm) of nucleic acid hybridizations decreases in the 

20 following order: RNA:RNA, DNA:RNA, DNA:DNA. For hybrids of greater than 

100 nucleotides in length, equations for calculating Tm have been derived (see Sambrook 
et al., supra, 9.50-9.51). For hybridizations with shorter nucleic acids, i.e., oligonucleotides, 
the position of mismatches becomes more important, and the length of the oligonucleotide 
determines its specificity (see Sambrook et al., supra, 1 1 .7-11.8). In one embodiment the 

25 length for a hybridizable nucleic acid is at least about 10 nucleotides. Preferable a minimum 
length for a hybridizable nucleic acid is at least about 1 5 nucleotides; more preferably at 
least about 20 nucleotides; and most preferably the length is at least 30 nucleotides. 
Furthermore, the skilled artisan will recognize that the temperature and wash solution salt 
concentration may be adjusted as necessary according to factors such as length of the probe. 

30 "Synthetic genes" can be assembled from oligonucleotide building blocks that are 

chemically synthesized using procedures known to those skilled in the art. These building 
blocks are ligated and annealed to form gene segments which are then enzymatically 
assembled to construct the entire gene. "Chemically synthesized", as related to a sequence 
of DNA, means that the component nucleotides were assembled in vitro. Manual chemical 

35 synthesis of DNA may be accomplished using well established procedures, or automated 

chemical synthesis can be performed using one of a number of commercially available 

machines. Accordingly, the genes can be tailored for optimal gene expression based on 

optimization of nucleotide sequence to reflect the codon bias of the host cell. The skilled 
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artisan appreciates the likelihood of successful gene expression if codon usage is biased 
towards those codons favored by the host. Determining preferred codons can be based on a 
survey of genes derived from the host cell where sequence information is available. 

"Gene" refers to a nucleic acid fragment that expresses a specific protein, including 
5 regulatory sequences preceding (5' non-coding sequences) and following (3 1 non-coding 
sequences) the coding sequence. "Native gene" refers to a gene as found in nature with its 
own regulatory sequences. "Chimeric gene" refers to any gene, not a native gene, 
comprising regulatory and coding sequences that are not found together in nature. 
Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that 

1 0 are derived from different sources, or regulatory sequences and coding sequences derived 
from the same source, but arranged in a manner different than that found in nature. 
"Endogenous gene" refers to a native gene in its natural location in the genome of an 
organism. A "foreign" gene refers to a gene not normally found in the host organism, but 
which is introduced into the host organism by gene transfer. Foreign genes can comprise 

15 native genes inserted into a non-native organism, or chimeric genes. A "transgene" is a gene 
that has been introduced into the genome by a transformation procedure. 

"Coding sequence" refers to a DNA sequence that codes for a specific amino acid 
sequence. "Regulatory sequences" refer to nucleotide sequences located upstream (5* non- 
coding sequences), within, or downstream (3' non-coding sequences) of a coding sequence, 

20 and which influence the transcription, RNA processing or stability, or translation of the 
associated coding sequence. Regulatory sequences may include promoters, translation 
leader sequences, introns and polyadenylation recognition sequences. 

"Promoter" refers to a DNA sequence capable of controlling the expression of a 
coding sequence or functional RNA. In general, a coding sequence is located 3' to a 

25 promoter sequence. The promoter sequence consists of proximal and more distal upstream 
elements, the latter elements often referred to as enhancers. Accordingly, an "enhancer" is a 
DNA sequence which can stimulate promoter activity and may be an innate element of the 
promoter or a heterologous element inserted to enhance the level or tissue-specificity of a 
promoter. Promoters may be derived in their entirety from a native gene, or be composed of 

30 different elements derived from different promoters found in nature, or even comprise 

synthetic DNA segments. It is understood by those skilled in the art that different promoters 
may direct the expression of a gene in different tissues or cell types, or at different stages of 
development, or in response to different environmental conditions. Promoters which cause a 
gene to be expressed in most cell types at most times are commonly referred to as 

35 "constitutive promoters". New promoters of various types useful in plant cells are 
constantly being discovered; numerous examples may be found in the compilation by 
Okamuro and Goldberg, (Biochem. Plants 15:1-82 (1989)). It is further recognized that 
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since in most cases the exact boundaries of regulatory sequences have not been completely 
defined, DNA fragments of different lengths may have identical promoter activity. 

The "translation leader sequence" refers to a DNA sequence located between the 
promoter sequence of a gene and the coding sequence. The translation leader sequence is 
5 present in the fully processed mRNA upstream of the translation start sequence. The 
translation leader sequence may affect processing of the primary transcript to mRNA, 
mRNA stability or translation efficiency. Examples of translation leader sequences have 
been described (Turner et ah, Mol Biotech. 3:225 (1995)). 

The "3* non-coding sequences" refer to DNA sequences located downstream of a 

1 0 coding sequence and include polyadenylation recognition sequences and other sequences 
encoding regulatory signals capable of affecting mRNA processing or gene expression. The 
polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid 
tracts to the 3' end of the mRNA precursor. The use of different 3* non-coding sequences is 
exemplified by Ingelbrecht et al. {Plant Cell 1:671-680 (1989)). 

1 5 "RNA transcript" refers to the product resulting from RNA polymerase-catalyzed 

transcription of a DNA sequence. When the RNA transcript is a perfect complementary 
copy of the DNA sequence, it is referred to as the primary transcript or it may be a RNA 
sequence derived from posttranscriptional processing of the primary transcript and is 
referred to as the mature RNA. "Messenger RNA" (mRNA) refers to the RNA that is 

20 without introns and that can be translated into protein by the cell. "cDNA" refers to a 
double-stranded DNA that is complementary to and derived from mRNA. "Sense" RNA 
refers to RNA transcript that includes the mRNA and so can be translated into protein by the 
cell. "Antisense RNA" refers to a RNA transcript that is complementary to all or part of a 
target primary transcript or mRNA and that blocks the expression of a target gene 

25 (U.S. 5,107,065). The complementarity of an antisense RNA may be with any part of the 
specific gene transcript, i.e., at the 5' non-coding sequence, 3* non-coding sequence, introns 
or the coding sequence. "Functional RNA" refers to antisense RNA, ribozyme RNA or 
other RNA that is not translated yet has an effect on cellular processes. 
The term "operably-linked" refers to the association of nucleic acid sequences on a 

30 single nucleic acid fragment so that the function of one is affected by the other. For 
example, a promoter is operably-linked with a coding sequence when it affects the 
expression of that coding sequence (i.e., that the coding sequence is under the transcriptional 
control of the promoter). Coding sequences can be operably-linked to regulatory sequences 
in sense or antisense orientation. 

35 The term "expression" refers to the transcription and stable accumulation of sense 

(mRNA) or antisense RNA derived from the nucleic acid fragment of the invention. 

Expression may also refer to translation of mRNA into a polypeptide. "Antisense 

inhibition" refers to the production of antisense RNA transcripts capable of suppressing the 
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expression of the target protein. "Overexpression" refers to the production of a gene product 
in transgenic organisms that exceeds levels of production in normal or non-transformed 
organisms. "Co-suppression" refers to the production of sense RNA transcripts capable of 
suppressing the expression of identical or substantially similar foreign or endogenous genes 
5 (U.S. 5,231,020). 

"Altered levels" refers to the production of gene product(s) in organisms in amounts 
or proportions that differ from that of normal or non-transformed organisms. 

"Mature" protein refers to a post-translationally processed polypeptide; i.e., one from 
which any pre- or propeptides present in the primary translation product have been removed. 
1 0 "Precursor" protein refers to the primary product of translation of mRNA; i.e., with pre- and 
propeptides still present. Pre- and propeptides may be but are not limited to intracellular 
localization signals. 

A "chloroplast transit peptide" is an amino acid sequence which is translated in 
conjunction with a protein and directs the protein to the chloroplast or other plastid types 

15 present in the cell in which the protein is made. "Chloroplast transit sequence" refers to a 
nucleotide sequence that encodes a chloroplast transit peptide. A "signal peptide" is an 
amino acid sequence which is translated in conjunction with a protein and directs the protein 
to the secretory system (Chrispeels, J. J., Ann. Rev. Plant Phys. Plant MoL Biol 42:21-53 
(1 991)). If the protein is to be directed to a vacuole, a vacuolar targeting signal {supra) can 

20 further be added, or if to the endoplasmic reticulum, an endoplasmic reticulum retention 
signal (supra) may be added. If the protein is to be directed to the nucleus, any signal 
peptide present should be removed and instead a nuclear localization signal included 
(Raikhel et al., Plant Phys. 100:1627-1632 (1992)). 

'Transformation" refers to the transfer of a nucleic acid fragment into the genome of * 

25 a host organism, resulting in genetically stable inheritance. Host organisms containing the 
transformed nucleic acid fragments are referred to as "transgenic" organisms. Examples of 
methods of plant transformation include Agrobacterium-mediated transformation (De Blaere 
et al., Meth. Enzymol. 143:277 (1987)) and particle-accelerated or "gene gun" transformation 
technology (Klein et al., Nature, London 327:70-73 (1987); U.S. 4,945,050). 

30 Standard recombinant DNA and molecular cloning techniques used herein are well 

known in the art and are described more fully in Sambrook, J., Fritsch, E. F. and Maniatis, T. 
Molecular Cloning: A Laboratory Manual ; Cold Spring Harbor Laboratory Press: Cold 
Spring Harbor, 1 989 (hereinafter "Sambrook et al."). 

Unique plant homologs of microbial m-prenyltransferase proteins, involved in the 

35 synthesis of poly-cw-isoprenoids, have been isolated from wheat, grape, soybean, rice, 
African daisy, rubber and marigold. Comparison of their random cDNA sequences to the 
GenBank database using the BLAST algorithm, well known to those skilled in the art, 
revealed that these proteins have no significant homologies to other identified proteins in 

14 



WO 01/21650 



PCT/US00/25856 



plants. The nucleotide sequences of the present homolog cDNAs are provided in SEQ ID 
NO: 1 , SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 1 1, SEQ ID 
NO: 1 3, SEQ ID NO: 1 5, SEQ ID NO: 17 and SEQ ID NO: 1 9. Other poly-ciy-isoprenoid 
synthase genes and proteins from other plants can now be identified by comparison of 
random cDNA sequences to the present cw-prenyltransferase sequences provided herein. 

The present sequences were identified by comparison to public as well as internal 
database. Strong correlation was seen between the instant sequences and the cis~ 
prenyltransferase genes and proteins isolated from Micrococcus luteus Shimizu, N., 
Koyama, T. and Ogura,K., J. Biol Chem. 273:19476-19481 (1998)) and Saccharomyces 
cerevisiae. Accordingly it is an object of the present invention to provide nucleic acid 
molecules encoding plant c/s-prenyltransferase proteins where the nucleic acid sequence is at 
least 50% identical to the bacterial undecaprenyl diphosphate synthase gene isolated from 
Micrococcus luteus where a correlation of at least 80% is preferred. Similarly the invention 
provides plant cw-prenyltransferase proteins where the amino acid sequence is at least 41% 
identical to the bacterial undecaprenyl diphosphate synthase protein isolated from 
Micrococcus luteus where a correlation of at least 70% is preferred. 

The nucleic acid fragments of the present invention may be used to isolate cDNAs 
and genes encoding a homologous prenyltransferases from the same or other plant species. 
Isolating homologous genes using sequence-dependent protocols is well known in the art. 
Examples of sequence-dependent protocols include, but are not limited to, methods of 
nucleic acid hybridization and methods of DNA and RNA amplification as exemplified by 
various uses of nucleic acid amplification technologies (e.g., polymerase chain reaction 
(PCR) or ligase chain reaction). 

: For example, .other cw-prenyltransferase genes, (and particularly undecaprenyl 
diphosphate and dehydrodolichyl diphosphate synthases) either as cDNAs or genomic 
DNAs, could be isolated directly by using all or a portion of the present nucleic acid 
fragments as DNA hybridization probes to screen libraries from any desired plant using 
methodology well known to those skilled in the art. Specific oligonucleotide probes based 
upon the present c/s-prenyltransferase sequences can be designed and synthesized by 
methods known in the art (Sambrook et al., supra). Moreover, the entire sequences can be 
used directly to synthesize DNA probes by methods known to the skilled artisan such as 
random primers, DNA labeling, nick translation, or end-labeling techniques, or RNA probes 
using available in vitro transcription systems. In addition, specific primers can be designed 
and used to amplify a part of or full-length of the present sequences. The resulting 
amplification products can be labeled directly during amplification reactions or labeled after 
amplification reactions, and used as probes to isolate full length cDNA or genomic 
fragments under conditions of appropriate stringency. 
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In addition, two short segments of the present nucleic acid fragment may be used in 
PCR protocols to amplify longer nucleic acid fragments encoding homologous genes from 
DNA or RNA. The polymerase chain reaction may also be performed on a library of cloned 
nucleic acid fragments wherein the sequence of one primer is derived from the present 
5 nucleic acid fragments, and the sequence of the other primer takes advantage of the presence 
of the polyadenylic acid tracts to the 3' end of the mRNA precursor encoding plant UPPS 
homologs. 

Alternatively, the second primer sequence may be based upon sequences derived 
from the cloning vector. For example, the skilled artisan can follow the RACE protocol 

10 (Frohman et al, Proa Natl. Acad Sci USA 85:8998 (1988)) to generate cDNAs by using 
PCR to amplify copies of the region between a single point in the transcript and the 3* or 
5' end. Primers oriented in the 3* and 5' directions can be designed from the present 
sequences. Using commercially available 3' RACE or 5' RACE systems (BRL), specific 3* 
or 5' cDNA fragments can be isolated (Ohara et al., Proc. Natl. Acad ScL t USA 86:5673 

15 (1989); Loh et al., Science 243:217 (1989)). Products generated by the 3' and 5' RACE 
procedures can be combined to generate full-length cDNAs (Frohman et al., Techniques 
1:165 (1989)). 

Finally, availability of the present nucleotide and deduced amino acid sequences 
facilitates immunological screening of cDNA expression libraries. Synthetic peptides 

20 representing portions of the present amino acid sequences may be synthesized. These 

peptides can be used to immunize animals to produce polyclonal or monoclonal antibodies 
with specificity for peptides or proteins comprising the amino acid sequences. These 
antibodies can be then be used to screen cDNA expression libraries to isolate full-length 
cDNA clones of interest (Lerner et al., Adv. Immunol 36:1 (1984); .Sambrook et al., supra). 

25 The nucleic acid fragments of the present invention may also be used to create 

transgenic plants in which the present cw-prenyltransferase protein is present at higher or 
lower levels than normal. Alternatively, in some applications, it might be desirable to 
express the present m-prenyltransferase protein in specific plant tissues and/or cell types, or 
during developmental stages in which they would normally not be encountered. The 

30 expression of full-length plant c/s-prenyltransferase cDNAs (ie., any of the sequences below 
or related sequences incorporating an appropriate in-frame ATG start codon) in a bacterial 
(e.g., £ colt), yeast (eg, Saccharomyces cerevisiae, Pichia pastoralis) or plant yields a 
mature protein capable of the synthesis of cis-polyisoprenoids from substrate IPP. The 
presence of an initiator allylic isoprenoid diphosphate (DMAPP, GPP, FPP or GGPP) 

35 enhances this activity. 

It is contemplated that transgenic plants expressing the present cw-prenyltransferase 
sequences will have altered or modulated defense mechanisms against various pathogens 
and natural predators. For example, various latex proteins are known to be antigenic and 
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recognized by IgE antibodies, suggesting their role in immunolgical defense (Yagami et al., 
Journal of Allergy and Clinical Immunology, (March, 1998) Vol. 101, No. 3, pp. 379-385. 
Additionally it has been shown that a significant portion of the latex isolated from Hevea 
brasiliensis contains chitinases/lysozymes, which are capable of degrading the chitin 
component of fungal cell walls and the peptidoglycan component of bacterial cell walls 
(Martin, M. N, Plant Physiol (Bethesda), (1991) 95 (2), 469-476). It is therefore an object 
of the present invention to provide transgenic plants having altered, modulated or increased 
defenses towards various pathogens and herbivores. 

The plant species suitable for expression of the present sequences may be (but are not 
limited to) tobacco (Nicotiana spp.), tomato (Lycopersicon spp.), potato (Solanum spp.), 
hemp (Cannabis spp.), sunflower (Helianthus spp.), sorghum (Sorghum vulgare), wheat 
(Triticum spp.), maize (Zea mays), rice (Oryza sativa), rye (Secale cereale), oats (Avena 
spp.), barley (Hordeum vulgare), rapeseed (Brassica spp.), broad bean (Vicia faba), french 
bean (Phaseolus vulgaris), other bean species (Vigna spp.), lentil (Lens culinaris), soybean 
(Glycine max), arabidopsis (Arabidopsis thaliana), guayule (Parthenium argentatum), cotton 
(Gossypium hirsutum), petunia (Petunia hybrida), flax (Linum usitatissimum) and carrot 
(Daucus car ota sativa). 

Various methods of transforming cells of higher plants according to the present 
invention are available to those skilled in the art (see EPO Pub. 0 295 959 A2 and 
0 3 18 341 A 1). Such methods include those based on transformation vectors utilizing the Ti 
and Ri plasmids of Agrobacterium spp. It is particularly preferred to use the binary type of 
these vectors. Ti-derived vectors transform a wide variety of higher plants, including 
monocotyledonous and dicotyledonous plants (Sukhapinda et al., Plant Mol. Biol. 8:209-216 
t i (1987); Potrykuset al., Mol. Gen. Genet. 199: 183.(1985)). Other transformation methods 
are available to those skilled in the art, such as direct uptake of foreign DNA constructs (see 
EPO Pub. 0 295 959 A2), techniques of electroporation (Fromm et al., Nature (London) 
319:791 (1986)) or high-velocity ballistic bombardment with metal particles coated with the 
nucleic acid constructs (Kline et al., Nature (London) 327:70 (1987)). Once transformed, 
the cells can be regenerated by those skilled in the art. 

Of particular relevance are the recently described methods to transform foreign genes 
into commercially important crops, such as rapeseed (De Block et al., Plant Physiol. 
91:694-701 (1989)), sunflower (Everett et al., Bio/Technology 5:1201 (1987)), and soybean 
(Christou et al., Proc. Natl Acad Sci. USA 86:7500-7504 (1989)). 

Overexpression of the present cw-prenyltransferase homologs may be accomplished 

by first constructing a chimeric gene in which their coding region is operably-linked to a 

promoter capable of directing expression of a gene in the desired tissues at the desired stage 

of development. For reasons of convenience, the chimeric gene may comprise promoter 

sequences and translation leader sequences derived from the same genes. 3' Non-coding 

17 



WO 01/21650 



PCT/US00/25856 



sequences encoding transcription termination signals must also be provided. The present 
chimeric genes may also comprise one or more introns in order to facilitate gene expression. 

Plasmid vectors comprising the present chimeric genes can then be constructed. The 
choice of a plasmid vector depends upon the method that will be used to transform host 
5 plants. The skilled artisan is well aware of the genetic elements that must be present on the 
plasmid vector in order to successfully transform, select and propagate host cells containing 
the chimeric gene. The skilled artisan will also recognize that different independent 
transformation events will result in different levels and patterns of expression (Jones et al., 
EMBOJ. 4:241 1-2418 (1985); De Almeida et al., Mol. Gen. Genetics 218:78-86 (1989)), 
10 and thus that multiple events must be screened in order to obtain lines displaying the desired 
expression level and pattern. Such screening may be accomplished by Southern analysis of 
DNA, Northern analysis of mRNA expression, Western analysis of protein expression, or 
phenotypic analysis. 

For some applications it may be useful to direct the cw-prenyltransferase protein to 

15 different cellular compartments or to facilitate their secretion from the cell. The chimeric 

genes described above may be further modified by the addition of appropriate intracellular or 
extracellular targeting sequence to their coding regions. These include chloroplast transit 
peptides (Keegstra et al., Cell 56:247-253 (1989)), signal sequences that direct proteins to 
the endoplasmic reticulum (Chrispeels et al., Ann. Rev. Plant Phys. Plant Mol. 42:21-53 

20 (1991)), and nuclear localization signal (Raikhel et al., Plant Phys. 100:1627-1632 (1992)). 
While the references cited give examples of each of these, the list is not exhaustive and more 
targeting signals of utility may be discovered in the future. 

It may also be desirable to reduce or eliminate expression of the m-prenyltransferase 
genes in plants for some applications. In order to accomplish thiSj chimeric genes designed* 

25 for antisense or co-suppression of cw-prenyltransferase homologs can be constructed by 
linking the genes or gene fragments encoding parts of these enzymes to plant promoter 
sequences. Thus, chimeric genes designed to express antisense RNA for all or part of a 
UPPS homolog can be constructed by linking the c/s-prenyltransferase homolog genes or 
gene fragments in reverse orientation to plant promoter sequences. The co-suppression or 

30 antisense chimeric gene constructs could be introduced into plants via well known 

transformation protocols wherein expression of the corresponding endogenous genes are 
reduced or eliminated. 

The present cw-prenyltransferase homolog proteins may be produced in heterologous 
host cells, particularly in the cells of microbial hosts, and can be used to prepare antibodies 

35 to the proteins by methods well known to those skilled in the art. The antibodies would be 

useful for detecting the present c/s-prenyltransferase proteins in situ in cells or in vitro in cell 

extracts. Preferred heterologous host cells for production of the present cw-prenyltransferase 

proteins are microbial hosts. Microbial expression systems and expression vectors 
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containing regulatory sequences that direct high level expression of foreign proteins are well 
known to those skilled in the art. Any of these could be used to construct a chimeric gene 
for production of the present m-prenyltransferase homologs. This chimeric gene could then 
be introduced into appropriate microorganisms via transformation to provide high level 
5 expression of the present c/s-prenyltransferase proteins. 

Microbial host cells suitable for the expression of the present cw-prenyltransferase 
proteins include any cell capable of expression of the chimeric genes encoding these 
proteins. Such cells will include both bacteria and fungi including, for example, the yeasts 
(e.g., Aspergillus, Saccharomyces, Pichia, Candida and Hansenula), members of the genus 

10 Bacillus as well as the enteric bacteria (e.g., Escherichia, Salmonella and Shigella), Methods 
for the transformation of such hosts and the expression of foreign proteins are well known in 
the art and examples of suitable protocols may be found In Manual of Methods for General 
Bacteriology; Gerhardt et al., Eds.; American Society for Microbiology: Washington, DC, 
1994 or In Biotechnology: A Textbook of Industrial Microbiology, 2nd Edition, Brock, 

15 T. D., Ed.; Sinauer Associates, Inc.: Sunderland, MA, 1989. 

Vectors or cassettes useful for transforming suitable microbial host cells are well 
known in the art. Typically the vector or cassette contains sequences directing transcription 
and translation of the relevant gene, a selectable marker, and sequences allowing 
autonomous replication or chromosomal integration. Suitable vectors comprise a region 5' 

20 of the gene which harbors transcriptional initiation controls and a region 3' of the DNA 

fragment which controls transcriptional termination. It is most preferred when both control 
regions are derived from genes homologous to the transformed host cell, although such 
control regions need not be derived from the genes native to the specific species chosen as a 
- produption.host. . ; ;; . . , .; .. .-■ - -\- .-. - . , . ~ 

25 Initiation control regions or promoters useful to drive expression of the genes 

encoding the m-prenyltransferase proteins in the desired host cell are numerous and familiar 
to those skilled in the art. Virtually any promoter capable of driving these genes is suitable 
for the present invention including but not limited to CYC1, HIS3, GAL1, GAL 10, ADH1, 
PGK, PHOS, GAPDH, ADC1, TRP1, URA3, LEU2, ENO, TPI (useful for expression in 

30 Saccharomyces); AOX1 (useful for expression in Pichia); and lac, trp, 1P L , IPr, T7, tac, and 
trc (useful for expression in E. coll). Termination control regions may also be derived from 
various genes native to the preferred hosts. Optionally, a termination site may be 
unnecessary; however, it is most preferred if included. 

Additionally, the present cw-prenyltransferase proteins can be used as targets to 

35 facilitate the design and/or identification of inhibitors of c/s-prenyltransferase homologs that 

may be useful as herbicides or fungicides. This could be achieved either through the rational 

design and synthesis of potent functional inhibitors that result from structural and/or 

mechanistic information that is derived from the purified present plant proteins, or through 
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random in vitro screening of chemical libraries. It is anticipated that significant in vivo 
inhibition of any of the cw-prenyltransferase homolog proteins described herein may 
severely cripple cellular metabolism and likely result in plant (or fungal) death. 

All or a portion of the nucleic acid fragments of the present invention may also be 
5 used as probes for genetically and physically mapping the genes that they are a part of, and 
as markers for traits linked to expression of the present m-prenyltransferase homologs. 
Such information may be useful in plant breeding in order to develop lines with desired 
phenotypes. For example, the present nucleic acid fragments may be used as restriction 
fragment length polymorphism (RFLP) markers. Southern blots (Sambrook et al., supra) of 
1 0 restriction-digested plant genomic DNA may be probed with the nucleic acid fragments of 
the present invention. The resulting banding patterns may then be subjected to genetic 
analyses using computer programs such as MapMaker (Lander et al., Genomics 1:174-181 
(1987)) in order to construct a genetic map. In addition, the nucleic acid fragments of the 
present invention may be used to probe Southern blots containing restriction endonuclease- 
1 5 treated genomic DNAs of a set of individuals representing parent and progeny of a defined 
genetic cross. Segregation of the DNA polymorphisms is noted and used to calculate the 
position of the present nucleic acid sequence in the genetic map previously obtained using 
this population (Botstein et al., Am. J. Hum. Genet 32:314-331 (1980)). 

The production and use of plant gene-derived probes for use in genetic mapping is 
20 described by Bematzky et al. (Plant Mol. Biol Reporter 4:37-41 (1986)). Numerous 
publications describe genetic mapping of specific cDNA clones using the methodology 
outlined above or variations thereof. For example, F2 intercross populations, backcross 
populations, randomly mated populations, near isogenic lines, and other sets of individuals 
may be used for mapping. Such methodologies are well known to those skilled in the art. 
25 Nucleic acid probes derived from the present nucleic acid sequences may also be 

\£§ used for physical mapping (i.e., placement of sequences on physical maps; see Hoheisel 

et al., Nonmammalian Genomic Analysis: A Practical Guide; Academic Press, 1996; 
||j pp. 3 1 9-346 and references cited therein). 

] In another embodiment, nucleic acid probes derived from the present nucleic acid 

30 sequence may be used in direct fluorescence in situ hybridization (FISH) mapping. 

Although current methods of FISH mapping favor use of large clones (several to several 

^ hundred kb), improvements in sensitivity may allow performance of FISH mapping using 

. .;':! 

shorter probes. 

A variety of nucleic acid amplification-based methods of genetic and physical 
35 mapping may be carried out using the present nucleic acid sequences. Examples include 
allele-specific amplification (Kazazian et al., J. Lab. Clin. Med. 1 14:95-96 (1989)), 
polymorphism of PCR-amplified fragments (CAPS; Sheffield et al., Genomics 16:325-332 
(1993)), allele-specific ligation (Landegren et al., Science 241 :1077-1080 (1988)), nucleotide 
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extension reactions (Sokolov et al., Nucleic Acid Res. 18:3671 (1990)), Radiation Hybrid 
Mapping (Walter et al., Nature Genetics 7:22-28 (1997)) and Happy Mapping (Dear et al., 
Nucleic Acid Res. 17:6795-6807 (1989)). For these methods, the sequence of a nucleic acid 
fragment is used to design and produce primer pairs for use in the amplification reaction or 
5 in primer extension reactions. The design of such primers is well known to those skilled in 
the art. In methods using PCR-based genetic mapping, it may be necessary to identify DNA 
sequence differences between the parents of the mapping cross in the region corresponding 
to the present nucleic acid sequence. This, however, is generally not necessary for mapping 
methods. 

10 Loss of function-mutant phenotypes may be identified for the present cDNA clones 

either by targeted gene disruption protocols or by identifying specific mutants for these 
genes contained in a population of plants carrying mutations in all possible genes (e.g., 
Ballinger et al., Proc. Natl. Acad. Sci. USA 86:9402 (1989); Koes et al., Proc. Natl Acad. 
Sci. USA 92:8149 (1995); Bensen et al., Plant Cell 7:75 (1995)). The latter approach may be 

15 accomplished in two ways. First, short segments of the present nucleic acid fragments may 
be used in polymerase chain reaction protocols in conjunction with a mutation tag sequence 
primer on DNAs prepared from a population of plants in which Mutator transposons or some 
other mutation-causing DNA element has been introduced (see Bensen, supra). The 
amplification of a specific DNA fragment with these primers indicates the insertion of the 

20 mutation tag element in or near the plant gene encoding the m-prenyltransferase protein. 

Alternatively, the present nucleic acid fragment may be used as a hybridization probe against 
PCR amplification products generated from the mutation population using the mutation tag 
sequence primer in conjunction with an arbitrary genomic site primer, such as that for a 
restriction enzyme , site-anchored synthetic adaptor. With either method,, a plant containing a 

25 mutation in the endogenous gene encoding a cw-prenyltransferase protein can be identified 
and obtained. This mutant plant can then be used to determine or confirm the natural 
function of the c/s-prenyltransferase gene product. 

The present invention is further defined in the following Examples, in which all parts 
and percentages are by weight and degrees are Celsius, unless otherwise stated. It should be 

30 understood that these Examples, while indicating preferred embodiments of the invention, 
are given by way of illustration only. From the above discussion and these Examples, one 
skilled in the art can ascertain the essential characteristics of this invention, and without 
departing from the spirit and scope thereof, can make various changes and modifications of 
the invention to adapt it to various usage and conditions. 

35 EXAMPLES 

GENERAL METHODS 

Standard recombinant DNA and molecular cloning techniques used here are well 

known in the art and are described by Sambrook et al., Molecular Cloning: A Laboratory 
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ManuM, 2 nd Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor (1989) 
(hereinafter "Sambrook et al."); and by T. J. Silhavy, M. L. Bennan, and L. W. Enquist, 
Ex periments with Gene Fusions , Cold Spring Harbor Laboratory Press, Cold Spring, NY 
(1984) and by Ausubel et al., Current Protocols in Molecular Biology , pub. by Greene 
5 Publishing Assoc. and Wiley-Interscience (1987). 

Nucleotide and amino acid percent identity and similarity comparisons were made 
using the GCG suite of programs, applying default parameters unless indicated otherwise. 

The meaning of abbreviations is as follows: "sec" means second(s), "min" means 
minute(s), " h" means hour(s), "d" means day(s), " uL" means microliter, "mL" means 
10 milliliters, "L" means liters, "mM" means millimolar, "M" means molar, and "mmoi" 
means millimole(s). 

EXAMPLE 1 

Composition of cDNA Libraries Used for Identification of cDNA Clones from Plant Species 

Encoding m-Prenyltransferase Homologs 
1 5 cDNA libraries representing mRNAs from wheat, grape, soybean, rice, African daisy, 

rubber tree latex and marigold tissues were prepared. The characteristics of the libraries are 
described in Table 1 . cDNA libraries were prepared by any one of several methods. The 
cDNAs were introduced into plasmid vectors by first preparing the cDNA libraries in Uni- 
ZAP XR vectors according to the manufacturer's protocol (Stratagene Cloning Systems, La 
20 Jolla, CA). The Uni-ZAP XR libraries were converted into plasmid libraries according to the 
protocol provided by Stratagene. Upon conversion, cDNA inserts were contained in the 
plasmid vector pBluescript. In an alternate approach the cDNAs were introduced directly 
into precut Bluescript II SK(+) vectors (Stratagene) using T4 DNA ligase (New England 
Biolabs), followed by transfection into DH10B cells according to the manufacturer's . 
25 protocol (GIBCO BRL Products). Once the cDNA inserts were in plasmid vectors, plasmid 
DNAs were prepared from randomly picked bacterial colonies containing recombinant 
pBluescript plasmids, or the insert cDNA sequences were amplified via polymerase chain 
gj reaction using primers specific for vector sequences flanking the inserted cDNA sequences. 

Amplified insert DNAs or plasmid DNAs were sequenced in dye-primer sequencing 
30 reactions to generate partial cDNA sequences (expressed sequence tags or "ESTs"; see 
Adams et al., Science 252:1651-1656 (1991). The resulting ESTs were analyzed using a 
Perkin Elmer Model 377 fluorescent sequencer. 
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TABLE 1 



cDNA Libraries from Plants 



Library 



Species and Tissue 



dms2c 



African daisy (Dimorphotheca sinuata) developing seeds 

pot marigold {Calendula officinalis) developing seeds 

para rubber tree (Hevea brasiliensis, PR255) latex tapped in 2 nd day 
of two day tapping cycle 

Grape (Vitis sp.) developing bud 

rice (Oryza sativa L.) fifteen day leaf (normalized) 

rice (Oryza sativa L.) root of two week old developing seedling 

soybean (Glycine max L.) of two week old developing seedlings 
treated with water 

wheat (Triticum aestivum L.) developing kernel, thirty days after 
anthesis 



ecslc 



ehb2c 



Vdblc 



rlOn 



rrl 



sll 



wdk5c 



EXAMPLE 2 



Characterization of ESTs 



ESTs encoding candidate c/5-prenyltransferases were identified by conducting 
BLAST (Basic Local Alignment Search Tool; Altschul et al., J. Mol Biol 215:403-410 
(1993); see also www.ncbi.nlm.nih.gov/BLAST/) searches for similarity to sequences 
contained in the BLAST "nr" database (comprising all non-redundant GenBank CDS 
translations, sequences derived from the 3-dimensional structure Brookhaven Protein Data 
Bank, the last major release of the SWISS-PROT protein sequence database, EMBL and 
DDBJ databases). The cDNA sequences obtained in Example 3 were analyzed for similarity 
to all publicly available DNA sequences contained m the " nr" database using theBLASTN 
algorithm provided by the National Center for Biotechnology Information (NCBI). The 
DNA sequences were translated in all reading frames and compared for similarity to all 
publicly available protein sequences contained in the "nr" database using the BLASTX 
algorithm (Gish, W. and States, D. J. Nature Genetics 3:266-272 (1993)) provided by the 
NCBI. For convenience, the P-value (probability) of observing a match of a cDNA sequence 
to a sequence contained in the searched databases merely by chance as calculated by BLAST 
are reported herein as "pLog" values, which represent the negative of the logarithm of the 
reported P-value. Accordingly, the greater the pLog value, the greater the likelihood that the 
cDNA sequence and the BLAST "hit" represent homologous proteins. 



Identification and Characterization of cDNA Clones for cfc-Prenvltransferases 
cDNAs from the libraries listed in Table 1 were identified as m-prenyltransferase 
homologs based on interrogation of the database described in Examples 1 and 2. cDNAs 
were thus identified by a number of methods, including the following: 1) keyword searches 



EXAMPLE 3 
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(e.g., ''undecaprenyl"), 2) searches of the database using the TBLASTN algorithm provided 
by the National Center for Biotechnology Information (NCBI) and short fragments of 
conserved sequence present in bacterial undecaprenyl synthases, and 3) identification of 
further homo logs of cDNAs discovered by 1 and 2 within the in-house database using the 
5 FASTA program. An alignment of the deduced amino acid sequence of the E. coli 
undecaprenyl pyrophosphate synthase gene with a number of other publicly-available 
sequences from bacteria, yeast (Saccharomyces cerevisiae) and one eukaryote 
{Caenorhabditis elegans) has been published (Apfel et al„ J. BacterioL 81:483-492 (1999)). 
This alignment revealed five conserved domains. One of these (Domain I) is present at the 

10 5' end of the ORFs of these genes, and consists of the following sequence: 

HXXMDGNXRXA (X = any amino acid; (SEQ ID NO:21)). Another (Domain V) is 
present towards the 3' end of the ORFs, and consists of the following sequence: 
DLXIRTXGEXRXSNFLLWQXXYXE (where X - any amino acid; (SEQ ID NO:22)). 
These sections of conserved sequence are likely to be diagnostic for the cw-preny transferase 

1 5 family of enzymes, and were used in the aforementioned TBLASTN searches. 

Further homologs of cDNAs discovered by the first and second method within the in- 
house database were identified as sequences homologous by FASTA alignment with a 
specified sequence, either restricted to the same library, or across all libraries or across a 
library group. The cDNAs identified by these means are listed in Table 2. 

20 

TABLE 2 

cDNAs Identified as c/s-Prenyltransferase Homologs 



Sequence identification number (SID) Source 





dms2c.pk005,c7 


African Daisy 




ecslc.pk009.pl9 


pot marigold 




ehb2c.pk001.il0 


Hevea brasiliensis 


- r -J 

^2 


ehb2c.pk001.di7 


Hevea brasiliensis 




ehb2c.pk001.ol8 


Hevea brasiliensis 




Vdblc.pk001.k23 


grape 




rl0n.pkll7.i23 


rice 




rrl.pk0050.h8 


rice 




sll.pk0128.h7 


soybean 


'* A 
- • -l 


wdk5c.pk005.f22 


wheat 



Comparison of the nucleotide (SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID 
25 NO:7, SEQ ID NO:9, SEQ ID NO:l 1 , SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17 and 
SEQ ID NO: 19) and deduced amino acid (SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID 
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NO: 18 and SEQ ID NO:20) sequences of these ESTs with those of a representative bacterial 
m-prenyltransferase {Micrococcus luteus UPPS; Shimizu, N., Koyama, T. and Ogura, K., J. 
Biol Chem. 273:19476-19481 (1998)) show them to exhibit >45% identity in nucleotide 
sequence and >30% identity in amino acid sequence. Table 3 lists the comparison of the cis~ 
prenyltransferase sequences isolated from wheat, grape, soybean, rice, African daisy, rubber 
tree and pot marigold with the sequence of the Micrococcus luteus UPPS. Figure 2 shows 
an alignment of the nucleotide sequence within the coding regions of these cDNAs with 
those of Micrococcus luteus UPPS and two yeast c/'s-prenyltransferase genes, rer2 
(GenBank ACC. NO. AB013497) and srtl (GenBank ACC. NO. AB013498) which 
indicates the extent of homology between the primary sequence of these cis- 
prenyltransferase genes from diverse species. 

TABLE 3 

Comparison of Grape, Rice, Soybean, Rubber tree and African Daisy Sequences 
Against the Sequence of Micrococcus luteus Undecaprenyl Pyrophosphate Synthase 

% Identity 1 Similarity Identified to M. luteus Gene 5 

cDNA/deduced BLAST 



protein sequence 


NA 2 


AA 2 


algorithm 


Score^ 


pLofi 4 


dms2c.pk005.c7 


50.13 


39.024 


Xnr 


162 


10.57 


ecslc.pk009.pl 9 


50.40 


38.938 








ehb2c.pk001.il0 


46.00 


33.603 


Xnr 


71 


1.48 


ehb2c.pk001.dl7 


46.133 


33.603 


Xnr 


161 


10.46 


ehb2c.pk001.ol8 


49.464 


32.129 








vdblc.pk001.ol8 


46.559 


34.413 








rl0n.pkH7.i23 


45.652 


33.186 


Xnr 


152 


• 9.41 


rrl.pk0050.h8 


45.699 


34.694 








sll.pk0128.h7 


50.133 


41.564 








wdk5c.pk005.f22 


43.067 


38.00 









1 Comparison made using GCG GAP program, applying default values. 

2 AA is the abbreviation for amino acid sequence; NA is the abbreviation for nucleotide sequence. 
^Score is the value assigned to a match between two sequences by the BLAST program. 

4 pLog is the negative of the logarithm of the reported P-value, the probability of observing a 

match of a cDNA sequence to a sequence contained in the searched databases merely by chance as calculated 
by BLAST. 

5 Given for those cDNAs where this similarity was detected by the initial BLAST search. 
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EXAMPLE 4 

Analysis of Deduced Amino Acid Sequence of cDNAs Identified as 
m-Prenvltransferase Homologs in Plants 
The plant cDNAs identified as described above were translated and the deduced 
5 amino acid sequences compared one to another using the GCG GAP program. Gap 

considers all possible alignments and gap positions between two sequences and creates a 
global alignment that maximizes the number of matched residues and minimizes the number 
and size of gaps. A scoring matrix is used to assign values for symbol matches. In addition, 
a gap creation penalty and a gap extension penalty are required to limit the insertion of gaps 
10 into the alignment. Gap uses the alignment method of Needleman and Wunsch (J. Mol Biol 
48:443-453 (1970)). It is clear from this analysis (Table 4) that these sequences encode 
polypeptides with a minimum of 27.826% identity. The highest identities revealed by this 
analysis are between sequences from the same species, with two rice sequences exhibiting 
90.668% identity and two rubber latex sequences 98.282% identity. The highest identity 
15 between sequences from different species was exhibited by the rice and grape sequences. In 
addition, alignment of the deduced amino acid sequence of these cDNAs together (Figure 3) 
and with bacterial and yeast cw-prenyltransferases (Figure 4) using the CLUSTALW 
program within the VECTOR NTI suite of programs reveals the presence of the conserved 
domains characteristic of this gene family (referred to in Example 2). 

20 

TABLE 4 

Identity Comparison Using the GAP Program of the Deduced Amino Acid 
Sequences from Plant c/s-Prenyltransferases 



SEQID 


2 


4 


6 


8 


10 


12 


14 


16 


18 


20 


2 


j| 


48.684 


31.907 


33.858 


31.923 


52.669 


33.043 


30.545 


58.537 


50.965 


4 


48.684 


100 


30.701 


30.702 


33.333 


46.222 


33.186 


33.186 


48.246 


45.133 


6 


31.907 


30.701 


100 


99.655 


78.547 


32.296 


47.773 


46.182 


33.588 


31.679 


8 


33.858 


30.702 


99.655 


100 


78.201 


32.296 


47.773 


46.182 


33.588 


31.679 


10 


31.923 


33.333 


78.547 


78.201 


100 


29.502 


46.154 


44.891 


32.067 


30.943 


12 


52.669 


46.222 


32.296 


32.296 


29.502 


100 


33.478 


31.250 


53.398 


, 48.450 


14 


33.043 


33.186 


47.773 


47.773 


46.154 


33.478 


100 


100 


32.05 1 


37.627 


16 


30.545 


33.186 


46.182 


46.182 


44.891 


31.250 


100 


100 


29.643 


30.916 


18 


58.537 


48.246 


33.588 


33.588 


32.061 


53.398 


32.05 1 


29.643 


100 


50.775 


20 


50.965 


45.133 


90.943 


31.679 


30.943 


48.450 


37.627 


30.916 


50.775 


100 



26 



WO 01/21650 



PCT/US00/25856 



EXAMPLE 5 

Transformation and Expression of Hevea 
m-Prenyltransferase in Dandelion Plants 

A chimeric gene comprising the Hevea m-prenyltransferase gene (SEQ ID NO: 5) in 

5 sense orientation is constructed by polymerase chain reaction (PCR) of the gene using 

appropriate oligonucleotide primers. Cloning sites (Ecorl and Kpnl) are incorporated into 

the oligonucleotides to provide proper orientation of the DNA fragment when inserted into 

the digested vector pML82. The binary vectors pML82 are transferred by a freeze/thaw 

method (Holsters etal., Mol Gen. Genet. 163:181-187 (1978)) to the Agrobacterium 

1 0 tumefaciens strain LBA4404 and Agrobacterium rhizogenes ATCC 1 5834 (Hockema et al., 
Nature 303:179-180 (1983)). 

Dandelion plants are transformed by co-cultivation of leaf and petiole explants with 
disarmed Agrobacterium tumefaciens strain LBA4404 and Agrobacterium rhizogenes strain 
ATCC 15834 carrying the appropriate binary vector. 

1 5 Dandelion leaf and petiole explants from greenhouse are sterilized by stirring in 70% 

ethanol for 10 min and transferring to 5% Chlorox™, 0.01% Triton-X 100 for 30 min, and 
then rinsing thoroughly with sterile distilled water. Liquid cultures of Agrobacterium for 
plant transformation are grown overnight at 28 °C in Minimal A medium containing 
100 mg/L kanamycin. The bacterial cells are pelleted by centrifugation and resuspended in 

20 liquid MS medium containing 1 mg/L BAP and 0.2 mg/L NAA to a density of A 6u q=0.5, 
leaf and petiole explants are inoculated with the bacteria suspension for 10 min, blotted dry 
with sterile filter paper, then co-cultivated on solidified MS medium for two to four days (in 
case of the explants and strain LBA440 co-cultivation, use MS medium containing 0.5 mg/L 
BAP and 0.2 mg/L NAA). The co-cultivations are terminated by ^transferring the explants 

25 onto the same medium plus 200 mg/L cefotaxime and 50 mg/L kanamycin to kill the 
Agrobacteria, and to select for transformed plant cell growth. 

The explants inoculated with LBA4404 strain are maintained at 27°C under cool 
white fluorescent lamps with a 16/8 h light/dark photoperiod. After three to four weeks, 
excised shoots are transferred onto rooting medium (1/2 MS plus 0.2 mg/L NAA) containing 

30 the same concentrations of antibiotics as above. Once the transformed plants have 

established their root systems, they are transferred directly into wet Metro-Mix 350 soilless 
potting medium. The pots are covered with plastic bags which are removed when the plants 
are clearly growing (after about ten days). 

The explants inoculated with ATCC 15834 strain are incubated at 27°C under 

35 continuous dark. After ten to fifteen days, excised roots were transferred to the same plates 
for large production of the transformed roots. 
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EXAMPLE 6 

Expression of Plant cfc-Prenyltransferase in Microbial Cells 
and Purification of Gene Product 
Example 6 illustrates the expression of isolated full length genes encoding cis- 
5 prenyltransferase proteins in E. coli > using as an example the expression of clone 
ehb2c.pk001.ol8. 

Plasmid DNA from ehb2c.pk001.ol8 is purified using QIAFilter cartridges (Qiagen 
Inc., 9600 De Soto Avenue, Chatsworth, CA) according to the manufacturer's instructions. 
Sequence is generated on an ABI Automatic sequencer using dye terminator technology 
10 (U.S. 5,366,860; EP 272007) using a combination of vector and insert-specific primers. 
Sequence editing is performed in either Vector NTI, DNAStar, or the Wisconsin GCG 
program {vide supra). 

cDNA from the full length clone ehb2c.pk001.ol8 encoding the instant cis- 
prenyltransferase enzyme is amplified with specific PCR primers designed to the 5* and 
15 3' ends of the coding region and containing appropriate restriction enzyme digestion sites. 
The amplified DNA is inserted into the vector pET28b by ligation into restriction sites 
suitable for expression under the control of the Tllac promoter according to the 
manufacturer's instructions (Novagen Inc., 597 Science Drive, Madison, WI). The vector is 
then used to transform BL21(DE3) competent E, coli hosts, and selected on LB agar plates 
20 containing 50 jig/mL kanamycin. Colonies arising from this transformation are grown 
overnight at 37°C in Lauria Broth to an OD 60 o of approximately 0.5, and induced with 

50 mM IPTG and allowed to grow for an additional 4.5 h. The culture is harvested, 
resuspended in buffer, lysed with a French press and cleared by centrifugation at 20,000 x g. 
Centrifugation of the supernatant after 20,000 x g centrifugation at 100,000 x g for l h 
25 yielded a membrane fraction, which is resuspended in buffer to approximately 7 mg 
£j protein/mL. Proteins in this purified membrane fraction are examined on 4-12% SDS-PAGE 

Gels (Novex, 1 1040 Roselle Street, San Diego, CA) after staining with Gelcode reagent 
(Pierce, P.O. Box 117, Rockford, IL). By comparison of the stained gel with one prepared 
from similar preparations from E. coli cells not expressing the putative cw-prenyltransferase, 
30 the protein corresponding to ehb2c,pk00 1 .o 1 8 (molecular mass 34,044 Daltons) is present at 
a significant level in this purified membrane fraction. Isolation of membranes from 
microbial hosts containing expressed c/s-prenyltransferase proteins as described in this 
example, or further purification (e.g., by chromatographic means following solubilization of 
the protein) provides sufficient enzyme protein for analysis by biochemical, chemical or 
35 physicochemical means. 
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EXAMPLE 7 

Expresson of Plant ds-Prenvltransferases in Arabidopsis thaliana 
Chimeric genes comprising Hevea, rice and soybean c/s-prenyltransferases (SEQ ID 

NO:9, 15 and 17, respectively) in sense orientation were constructed by polymerase chain 

reaction (PCR) from plasmids containing the Hevea, rice or soybean c/'j-prenyltransferase 

homologs, for expression in Arabidopsis thaliana. 

The Hevea DNA (designated Hpt3) was amplified by PCR from clone 

ehb2c.pk001.ol8, using oligonucleotide primers Hpt3/Xba I (5- 

GCTCTAGAGAAGGTTAAGTCAGTTTAGCATCG-3 1 ) (SEQ ID NO:29), and Hpt3/Kpn I 
(5 , -GGGGTACCTTATTr^AAATATTCCTTATGCTTCTCC-3 , ) (SEQ ID NO:30) The 
amplified Hpt3 cDNAs were digested with Xbal and Kpnl and separated on an agrose gel. 
The DNA fragment was isolated and purified using a QIAguick Gel Extraction Kit according 
to the manufacture's instructions (Qiagen, USA). The purified DNA fragment was cloned 
into the corresponding sites of the binary vector pBI-35S (vide infra). 

The rice and soybean DNAs were similarly isolated by PCR. For these clones, 
BamHI and SacI cloning sites were incorporated into the oligonucleotide primers to provide 
proper orientation of the DNA fragment when inserted into the binary vector pGV827. The 
rice homolog was amplified from clone rrl .pk0050.h8 using primers JK1 (5'- 
GTGGATCCATGCTTGGCTCACTTATG-3 ') (SEQ ID NO:31)and JK2 (5'- 
TTGAGCTCTATCTCC TCCCAGGG AGG-3 ') (SEQ ID NO:32) and the soybean 
homologue was amplified from clone sll.pk0128.h7 using primers JK3 (5- 
ACGGATCCATGTTCTCGTTAAGACTCC-3') (SEQ ID NO:33) and JK4 (5'- 
TCGAGCTCTTATGAATGTCGACCACC-3') (SEQ ID NO:34). PCR products were 
cloned into the pGEM T-easy vector using a TA-cloning kit (Promega Corporation, 2800 
Woods Hollow Road, Madison, WI) and these plasmids were then transformed into K coli. 

In addition to the cfc-prenyitransferase genes identified in in-house databases, several 
Arabidopsis thaliana genomic DNA fragments containing putative cw-prenyl transferase 
gene sequences were identified in public databases by conducting BLAST searches using the 
sequences of bacterial and yeast c/s-prenyl transferases essentially as outlined in Example 3. 
One gene, designated Apt5 (SEQ ID NO:37) from Arabidopsis thaliana chromosome 5 
genomic DNA (GenBank accession number AB01 1483), contains an 813 nt open reading 
frame with no intron sequences which encodes a protein with 271 amino acids and extensive 
homology to the microbial and plant m-prenyltransferase sequences described in 
Examples 3 and 4. It was decided to include this gene in our arabidopsis transformation 
experiments to determine the effect of overexpression of an endogenous gene. The Apt5 
gene (SEQ ID NO:37) was cloned by PCR amplification using Arabidopsis thaliana 
genomic DNA as a template. Primers were designed to include specific restriction sites at 
each end to facilitate in cloning. The Primers used were Apt5/Xbal (5'- 
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CTAGTCTAGAATCTCCCCTCCGATAACCAAAAAATCC-3') (SEQ ID NO:35 )and 
Apt5/Kpnl (S'-GGGGTACCTAGGGTTTAACTTAGAAACTATTTAG-S') (SEQ ID 
NO:36). The amplified Apt5 gene (SEQ ID NO:37) was digested with Xbal and Kpnl and 
separated on an agrose gel. The DNA fragment, ca. 850 bp in length, was isolated and 
5 purified using a QIAguick Gel Extraction Kit according to the manufacture's instructions 
(Qiagen, USA). The purified DNA fragments were cloned into a pBluescript vector 
according to manufacturer's instructions (Stratagene, 11011 North Torry Pines Road, 
LaJoIla, CA). 

To verify integrity of the amplified DNAs, plasmids were isolated and purified using 

10 QIAFilter cartridges (Qiagen Inc., 9600 De Soto Avenue, Chatsworth, CA) according to the 
manufacturer's instructions. Sequence was generated on an ABI Automatic sequencer using 
dye terminator technology (U.S. 5,366,860; EP 272007) using a combination of vector- 
specific primers. Sequence editing was performed in either Vector NTI, DNAStar, or the 
Wisconsin GCG program (vide supra). 

15 The plasmid, pBI-35S, containing Hpt3 gene was transformed into Argobacterium 

tumefaciens strain C58 using a freeze-thaw method (Holsters et al., Mol. Gen. Genet. 
1 63 : 1 8 1 - 1 87 (1 978)). Arabidopsis thaliana plants were transformed via Agrobacterium- 
mediated transformation (Clough S. J., Bent A. F.; Plant Journal 1998 Dec; 16(6): 735-43). 
The plasmids encoding rice and soybean c/s-prenyltransferases were digested with 

20 BamHI and Sad and the cDNA fragments encoding the instant cw-prenyltransferases were 
isolated by agarose gel purification. The fragments were ligated into a derivative of the 
binary vector pBIN19 (Frisch, R.A. et al (1995) Complete sequence of the binary vector 
BIN 19. Plant Molecular Biology 27, 405-409) containing a 35S cauliflower mosaic virus 
- promoter and the nopaline synthase 3' translation termination sequence (nos) with - 

25 appropriate restriction sites. The resulting rice and soybean gene expression constructs were 
termed 35S:: rrl and 35S::sll, respectively. These plasmids were transformed into E. coli 
and the integrity of the binary vectors was confirmed by plasmid isolation and restriction 
enzyme digestion as described above. The plasmids were then transformed into the 
Agrobacierium tumefaciens strain C58CI by a freeze/thaw method (Holsters et al., Mol Gen. 

30 Genet. 1 63 : 1 8 1 -1 87 (1978)). Agrobacterium lines bearing the binary vector constructs were 
selected using PCR and used to transform Arabidopsis thaliana using the floral dip method 
(Clough S. J., Bent A. F.; Plant Journal 1998 Dec; 16(6): 735-43). 

A binary vector, pBI-35S, was constructed for expression of the Apt5 gene (SEQ ID 
NO:37) in plants by ligating an 800 bp Hind III-Xba I CaMV 35 promoter DNA fragment 

35 (Guilley H, Dudley R. K., Jonard G, Balazs E, Richards K. E. (1982) Transcription of 
Cauliflower mosaic virus DNA: detection of promoter sequences, and characterization of 
transcripts, Cell 30(3):763-73) into the corresponding sites of the vector pBIB/NPT (Detlef 
Becker (1990) Binary vectors which allow the exchange of plant selectable mekers and 
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reporter genes. Nucleic Acids Research 1 8(1):203) to yield the binary vector pBI-35S. The 
Xba I-Kpn I DNA fragment encoding the Apt5 gene (SEQ ID NO:37) was then cloned into 
the pBI-35S vector, yielding the construct 35S::Apt5, This construct was transformed into 
Argobacterium tumefaciens strain C58 using a freeze-thaw method (Holsters et al., Mol. 
Gen. Genet. 163:181-187 (1978)). Arabidopsis thaliana plants were transformed via 
Agrobacterium-mediated transformation (Clough S. J., Bent A. F., Plant Journal 1998 Dec; 
16(6): 735-43). 

The seeds produced from infected plants were plated on agar plates containing 
100 ug/ml kanamycin. Arabidopsis plants resistant to kanamycin were selected and planted 
into soil. 

EXAMPLE 8 

Analysis of the Polyprenol Profile of Transgenic Plants 
Heterozygous transgenic plants expressing either the rice, Hevea brasiliensis, 
Arabidopsis or soybean cw-prenyltransferase homologs described in Example 7 were grown 
at 19°C, with 18 hours of light/day. Rosette leaves were harvested, frozen in liquid nitrogen 
and then lyophilized. The dried leaf material was extracted overnight in 2 ml of 
chloroform:methanol (2:1 v/v); geranylgeraniol was added at 1 ug per 10 mg dry weight to 
act as an internal standard. The organic extracts were washed with 400 ul of water and the 
aqueous phase discarded. The extracts were then dried down under a stream of nitrogen, 
and, after redissolving in 1 ml of 2MKOH/50% methanol, saponified by heating at 70°C for 
2 hours. The saponification mixtures were extracted twice with hexane. A volume of these 
hexane extracts equivalent to 10 mg (dry weight) of leaf tissue was then analyzed by high- 
pressure combined liquid chromatography-mass spectrometry (LC_MS), using a Hewlett- 
Packard 1 100 Series LC-MS in atmospheric pressure chemical ionization (APCI) mode. 

Chromatography was conducted using a Zorbax C18 (2.1 x 150 mm; 5 um) reverse- 
phase column with methanol :isopropanol: water (12:8:1) at a flow rate of 0.25 ml/min as 
initial solvent. Polyprenols were eluted by applying a gradient of isopropanokhexane (1 :4), 
and elution monitored at 210 nm. Polyprenols were identifed by comparing their elution 
time and mass spectrum with those of authentic standards (Sigma, St. Louis, MO). 

The data from these analyses indicated that expression of the soybean clone 
sll.pk0128.h7 (SEQ ID NO: 17) and overexpression of the arabidopsis cw-prenyltransferase 

Apt5 caused significant alteration of the polyprenol composition of leaves of the transgenic 
arabidopsis plants. In both of these cases, dodecaprenol (a 60-carbon polyprenol (Cg 0 ), 

composed of 12 isoprene units) was undetectable either by examination of the diode array 
detector (DAD; Figure 5) response or by selective ion monitoring of the mass detector data 
(Table 5; Figure 6). 

Figure 5 illustrates the LC-MS analysis of extracts from wild-type and transgenic 
arabidopsis leaves. Samples equivalent to 10 mg leaf dry weight were separated by reverse 
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phase chromatography and polyprenol elution was monitored at 210 nm using a diode array 
detector (DAD). Elution of standard polyprenols (C45-C60) was indicated in the profile of 
the extract from wild-type arabidopsis. Similarly Figure 6 the LC-MS analysis of the 
molecular ion for dodecaprenol (C60) in rosette leaves of arabidopsis. 
5 In addition to this primary effect, the amounts of other polyprenols (45-, 50-, 55- 

carbon) were drastically reduced (Figure 5) compared to extracts of wild-type plants (which 
contain significant amounts of all of these polyprenols; Table 5, Figure 5). This effect was 
not seen in plants expressing the Hevea Hpt3 or rice clones. The data clearly indicates that 
overexpression of at least two of the genes identified in Examples 2 and 3, which by 
10 homology appear to encode plant c/s-prenyltransferases, dramatically alters the phenotype of 
transgenic plants with regard to polyprenol composition. 

TABLE 5 

Polyprenol profiles of Transgenic Arabidopsis Leaves 



polyprenol 


Wild-type 


35S::Hpt3 


35S::rrl 


35::S11 


35S::Apt3 


C45 

m/z 612-614 


+ 




+ 




+ 


C50 

m/z 680-682 


+ 


+ 


+ 


+ 


+ 


C55 

m/z 748-750 




+ 


+ 


+ 


+ 


C60 

m/z 816-818 


+ 


+ 


+ 


• * * ■ ■ ■ 





The presence of a particular polyprenol in extracts of wild type or transgenic arabidopsis leaves was 
determined by selective ion monitoring of the mass spectrometer output during chromatography of extracts. 
Presence is indicated by a '+* symbol, absence by a *-*. 
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CLAIMS 

What is claimed is: 

1 . An isolated nucleic acid fragment encoding a plant c/Vprenyltransferase protein 
selected from the group consisting of: 
5 (a) an isolated nucleic acid fragment encoding all or a substantial portion of 

the amino acid sequence selected from the group consisting of SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ 
ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 1 8 and SEQ ID 
NO:20; 

10 (b) an isolated nucleic acid fragment that is substantially similar to an isolated 

nucleic acid fragment encoding all or a substantial portion of the amino 
acid sequence selected from the group consisting of SEQ ID NO:2, SEQ 
ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, 
SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18 and SEQ ID NO:20; 

15 (c) an isolated nucleic acid fragment encoding a polypeptide, the polypeptide 

having at least 41% identity with the amino acid sequence set forth in SEQ 
ID NO:24; 

(d) an isolated nucleic acid fragment encoding having at least 50% identity 
with nucleic acid sequence as set forth in SEQ ID NO:23; 
20 (e) an isolated nucleic acid molecule that hybridizes with a nucleic acid 

sequence of (a) (b), (c) or (d) under the following hybridization conditions: 
0.1X SSC, 0.1% SDS, 65°C and washed with 0.2X SSC, 0.5% SDS; 

(f) an isolated nucleic acid fragment that hybridizes with a nucleic acid 
sequence selected from the group consisting of SEQ ID NO : 1 , SEQ ID 

25 NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 11 , SEQ 

ID NO: 1 3, SEQ ID NO: 1 5, SEQ ID NO: 1 7 and SEQ ID NO: 19 under the 
following hybridization conditions: 0.1X SSC, 0.1% SDS, 65°C and 
washed with 0.2X SSC, 0.5% SDS ; and 

(g) an isolated nucleic acid fragment that is complementary to (a), (b), (c), (d), 
30 (e) or (f). 

2. The isolated nucleic acid fragment of Claim 1 selected from the group consisting 
of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO;9, SEQ ID 
NO:l 1, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17 and SEQ ID NO: 19. 

3. A polypeptide encoded by the isolated nucleic acid fragment of Claim 1 . 

35 4. The polypeptide of Claim 3 selected from the group consisting of SEQ ID NO:2, 

SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID 
NO: 14, SEQ ID NO: 16, SEQ ID NO: 18 and SEQ ID NO:20. 
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5. A chimeric gene comprising the isolated nucleic acid fragments of Claim 1 
operably linked to suitable regulatory sequences. 

6. A transformed host cell comprising a host cell and the chimeric gene of Claim 5. 

7. The transformed host cell of Claim 6 wherein the host cell is selected from the 
group consisting of plant cells and microbial cells. 

8. AAost cell according to Claim 7 selected from the group consisting of tobacco 
(Nicotiana §=p.)> tomato (Lycopersicon spp.), potato (Solanum spp.), hemp {Cannabis spp.), 
sunflower Helianthus spp.), sorghum (Sorghum vulgare), wheat (Triticum spp.), maize (Zea 
mays), ri£ (Oryza sativa), rye (Secale cereale), oats (Avena spp.), barley (Hordeum 
vulgarei rapeseed (Brassica spp.), broad bean (Vicia faba), french bean (Phaseolus 
vulgaph other bean species {Vigna spp.), lentil (Lens culinaris), soybean (Glycine max), 
arab upsis (Arabidopsis thaliana), guayule (Parthenium argentatum), cotton (Gossypium 
hii^tum), petunia (Petunia hybrida), flax (Linum usitatissimum) and carrot (Daucus carota 
tfiva). 

9. The transformed host cell of Claim 7 wherein the host cell is selected from the 
group consisting of Aspergillus, Saccharomyces, Pichia, Candida, Hansenula, Bacillus, 
Escherichia, Salmonella and Shigella 

1 0. A method of altering the level of expression of a plant c/s-prenyltransferase 
protein in a host cell comprising: 

(a) transforming a host cell with the chimeric gene of Claim 6 and; 

(b) growing the transformed host cell produced in step (a) under conditions 
that are suitable for expression of the chimeric gene resulting in production 
of altered levels of a plant cw-prenyltransferase protein in the transformed 
host cell relative to expression levels of an untransformed host cell. 

11. A method according to Claim 1 0 wherein the host cell is a plant cell selected 
from the group consisting of tobacco (Nicotiana spp.), tomato (Lycopersicon spp.), potato 
(Solanum spp.), hemp (Cannabis spp.), sunflower (Helianthus spp.), sorghum (Sorghum 
vulgare), wheat (Triticum spp.), maize (Zea mays), rice (Oryza sativa), rye (Secale cereale), 
oats (Avena spp.), barley (Hordeum vulgar e), rapeseed (Brassica spp.), broad bean (Vicia 
faba), french bean (Phaseolus vulgaris), other bean species (Vigna spp.), lentil (Lens 
culinaris), soybean (Glycine max), arabidopsis (Arabidopsis thaliana), guayule (Parthenium 
argentatum), cotton (Gossypium hirsutum), petunia (Petunia hybrida), flax (Linum 
usitatissimum) and carrot (Daucus carota sativa). 

12. A method according to Claim 1 1 wherein the altering the level of expression of a 
plant cw-prenyl transferase protein results in a modulation in the defense mechanism of the 
plant. 
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13. A method of obtaining a nucleic acid fragment encoding all or a substantial 
portion of the amino acid sequence encoding a plant c/s-prenyltransferase protein 
comprising: 

(a) probing a cDNA or genomic library with the nucleic acid fragments of 
Claim 1; 

(b) identifying a DNA clone that hybridizes with the nucleic acid fragments of 
Claim 1 ; and 

(c) sequencing the cDNA or genomic fragment that comprises the clone 
identified in step (b), wherein the sequenced cDNA or genomic fragment 
encodes a plant cw-prenyltransferase protein. 

1 4. A method of obtaining a nucleic acid fragment encoding all or a substantial 
portion of the amino acid sequence encoding a plant c/s-prenyltransferase protein 
comprising: 

(a) synthesizing at least one oligonucleotide primer corresponding to a portion 
of the sequence selected from the group consisting of SEQ ID NO: 1, SEQ 
ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:l 1, 
SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17 and SEQ ID NO: 19; 

(b) amplifying a cDNA insert present in a cloning vector using the 
oligonucleotide primer of step (a); wherein the amplified cDNA insert 
encodes a plant cw-prenyltransferase protein. 

1 5. The product of the method of Claims 1 3 or 14. 



35 



PCT/USOO/25856 



1/24 



Polyprenol biosynthesis 




OPP 



DMAPP 



IPP 




NX 



as- 

prenyltransferases 



Polyprenols 



FIG. 1 

SUBSTITUTE SHEET (RULE 26) 



WO 01/21650 



2/24 



PCTYUS00/25856 



o 



O tr* 
o < 



< 
u 

E-. 
H 
< 
ID 
< 
U 
E- 

< 

Eh 
Eh 

O 
< 

o 
o 



O E-< 
m (J 
(J 
O 

o 
o 
u 
u 

E-" 

E- 
Eh 

U 
O 
Eh 

Eh 



U 

F« 

U 
< 
Eh 
U 
£h 

U 
CJ 
O 
O 
Eh 
Eh 

U 



5 

U 



rH «X> 



HHHHHO|CS(Vj 

O O O O o 

Z2:z;z:;zoooooooo 

O Q O O O 

rHrHrHrHrHOOOQQOQO 
MrHrHrHrHrHrHI— I 

«Q| QI Ql O 1 O 

wuuutoooaooooa 

— — cocoi/icocotococo 

r- o r* oo 

* CL-H T3 O (N X JC CM N CU 1h <P 

m h - ♦ ^ >w a o M 

O H H H ■ tn OD - • 3 M (fl 

ooooor-o<Mr-tm4-> 

-^OOOOrHOrHOO 3 JJ 
CU^^^^M^H^OOO^ tO CO 

o ■ - ■ • a ■ a a a s a <u 

CM O O O U • -H • • . >>» 

W H CJ CM CJ C ^ rH U (J 

£ « i3 J3 J3 o M <H h in 
-6 O XI XZ XI *—i to XI ^ 
<U <D CD 0) M *o TJ 

> s 



i-H 


CO 


m 




co 


m 




rH 


a\ 




in 




rH 


* * 


* * 


■ * 


* ■ « * 


rH 


t— i 


rH 


rH 


rH 




CM 


CM 


* * 


o 


o 


o 


O O 


















o 


z 


2: 


2: 


52 35 


O 


O 


O O 


o 


o 


O 


O 


IS 














z 


z 




z 








Q 


a 


Q 


a a 


















a 


M 


rH 


M 


►H M 


a 


Q 


Q 


Q 


a 


a 


O 


Q 


M 


o a o o o 


















a 


CO 


CO 


CJ 


pa cj 


oaooooao 


CO 


CO 


CO 


CO 


CO CO 


to 


U 


CO 




DO 


w 


[J 


to 


CO 










CO 


CO 


CO 


CO 


CO 


CO 


CO 


CO 




r- 


cn 


o 


r- go 


















r- 


u 


rH 


rH 


rH rH 


CO 


CO 


r- 


cn 


CM 




CM 


rH 


u 


• 




T3 O 


CM 


x: 


x: 


CM 


CM 




-M 


* 


m 
















m 


a 


0) 


M 


in 


o 




rH 


rH rH 


* 


m 


CD 


* 




a 


H 


to 


o 


o, 


o 


O 


o o 


r* 


O 


CM 


rH 


m 


+j 






o 




o 


o 


o o 


rH 


o 


rH 


o 


o 




4J 






a.* 


J* 




rH 


M 


O 


o 


o 


i— i 


CO 


<0 


a 


• 


a 










* 


03 


aj 


* 


o 


• 


* 


* * 


a 


* 


a 


a 


a s 


CJ 


0) 


o 


CN 


o 


o 


a o 


* 


rH 


• 


* 


* 






>i 


CM 


CO 


rH 


CM 


CM CM 




M 


rH 


o 


u 








0] 




« 


J3 




o 


U 


rH 


rH 


LD 












o 


jC 


-c x: 


rH 




V) 


JQ 










i 




aj 






M 






13 


























t> 














Lu 

C1IDCTITMTC CUm /Dill C OCX 



WO 01/21650 



PCT/US00/25856 



3/24 



rH r-i rH rH rH rH 





< 


o 


CJ 




< 




CJ 


u 




fr> 


Eh 




CJ 


Eh 


< 


6h 


Eh 


H 


CJ 




Eh 


CJ 




u 


Eh 


< 




«: 


cj 


a 


H 


cj 




'H 


o 


cj 


cj 


EH 


o 


Eh 


Eh 


o 




O 


Eh 




< 


£h 




CJ 


cj 




Eh 


C5 


CJ 


cj 








< 


CJ 




I 




CJ 


J 




CJ 


\ 


cj 


Eh 






CJ 




a 


O 


j 




Eh 




E-* 


Eh 


j 


cd 


Eh 


1 




Eh 


1 


o 


CJ 










o 


Eh 




£h 


Eh 




< 


Eh 




U 






u 


CJ 




Eh 


Eh 




Eh 


Eh 










CJ 


o 




o 


CJ 




Eh 


Eh 




< 


S 




t 






t 






CJ 






Eh 


CJ 




«: 


< 




iH 


(X) 




O 


CO 




iH 





E-* 
E- 
CJ 
O 
Eh 

Eh 

< 

CJ 

5 



o 
o 



CJ 
Eh 

u 

Eh 
Eh 

Eh 

O 
Eh 

Eh 
Eh 
Eh 
O 
Eh 
O 
CJ 
Eh 
O 
CJ 
U 

< 
CJ 
CJ 

ft 

u 

CJ 

s 

o 

**: 

o 
u 
Eh 
o 
t 
I 



£h 

fH U 

in E-* 

rH U 



n 



O H ID h 

H O CD Eh 
Eh Eh H U 



CD 



CD 



O CD 
Eh u 



< cd 



(J) U 

< Eh 
I I 



Eh 



Eh 



CD CD 
Eh £h 



Eh 
Eh 
O 



CD CD 
CD ID 
O H 
CD 
^ <J 
*i CD 
CD Eh 
CD H 
£h 

tD 

CD 
L) 

Eh 

Eh 

Eh 
CD 



CM K£> iH 

oj tj» n v 

« rH r-i 



H Eh Eh 

H Eh Eh 

< < 

^ Eh 

H Eh 

CD CD U 

CD CD CD 

CD CD C? 

B$i 



Eh 
Eh 




CD < CD 
CD CD CD 
< < H 




U U U 



CD 



o 
o 



VD \D 
CM CM CM 



CM 



HHHHHNNCM 

o o o o 

sisz^OOOOOOOO 
o q a a 

HHHHHHHH 

o o o o 

— — — — cyicocncQcncococo 



o r- co 
a ^ t) o <n x: x: cm cm 

. . . • -h * * M-r 



m h h 

o o o 
o o o 

^ JSef 

a Du ex 



o 

rH 

o 
a) 



o o 

CM CM 



a; 



* -in 

o r* o 

O rH O 

* Oi • 

U * rH 
CM £ M 

A G> U 

X: rH 

a> u 



CO • * 
CM rH m 
rH O O 

o o o 
^ ^ ^ 
a a a. 



in 



x> 
> 



o 

tO 
5 



CO CM rH 

a u u 

a a) m 

d ^ n 

4J 

rH CO 0) 

S 0) 0) 



o 6 o o 

32 IS 21 2 



tn n in 

rH CNJ CM CM 



o 

2 



a a q a o 



o o 
2 2 



a a a a 

CJ CJ CJ CJ 
CO CO CO CO 



U rH 

• a 

in • 
o a» 
o o 

M O 

* a 
u • 

CM O 

CO rH 

e « 

<D 



o 

rH rH 



O O 
O O 

a a 
• * 

u u 

CM CM 
X3 A 
JZ JZ 



H Q Q 

M M 

o 

w a a 

CO W CO 

— CO CO 

GO 

h m oo 

O CM -C 

* *H • 
rH • in 
o r- o 

O rH O 

rH M 

a 

* Qj * 

U ■ rH 
.C rH 

<D U 



o o o o o o 

% % s 2 2 s 

Q O O O Q Q 

M M M M M M 

oaoooo 

CJ CJ CJ CJ CJ u 

CO CO to CO CO CO 



r- 
x: 
- ^ 

ao 

CM rH 

rH O 

O O 



en cm 

CM CM 



«JH 

* 

m 
o 
o 



a a as 
* ■ • 

rH O -O 
H H If) 



(0 CM 

a u 
u 



HP 

u 

CO 

4J 
CO 
0] 

a> 



rH ro in r- <r* ro in 

rH rH 

o o o o o « 

Z 2 S 2 2 O O 

o o o a o 

M M rH M rH Q Q 

| - 1 | | 

ooaao 

CJ CJ CJ CJ CJ o O 

CO CO CO CO CO CJ CJ 

— — — CO CO 



O rH 

• a 

m « 
o o> 
o o 

M O 

* a 
o - 

CM U 

CO rH 

e w 
■a o 



O h 00 
iH rH rH 



ro cd 
CM X 



o o o 
o o o 

M *1 M 

a a a 



o o a 

CM CM CM 

X3 X* 

x: x 

<u o 



in 
o 
o 
M 

a 



XI 

cu 



G M 
O U 

rH 

u 



CNI 
I 

CM 
O 



WO 01/21650 



PCT/US00/25856 



4/24 



CD CD <J H O 

cd tj &t?.m & fa* 

CJ CJ CJ 

u o e- 




< < (J CJ H H 

u ^ cj 



U o 
cj < < 




E-* 

o < 

u cd 

H < (3 ID O 
O (J o < o 

JSEDj'tSi Ei 33E(t!55 

< CJ U H rfT£H 

< < u 2 < 

r^o MM 

U H U H 



£h 
Eh 
CD 




I 

C5 CD 



E- 
H 

o 

Eh 
< 



C5 
Eh 

< 
CJ 
E-> 
Eh 



u cd 

CD £h 

CD U 

< E- 



o 

m 



rrji 



U O C!) O 
O C) u u 

r~* E*^ E-* 



cj 


cj 


o 




«t 


*c 




o 


CD 




CD 


CD 




Fh 





eaoaeaoaisiBieaEaBisi 

o u»i raji&i 

V> c5 tewlHlo 





O ft h H H U O 



u u o u 
u- t-r o ■« O 




(j o o 

CJ o u 

E- 1 E-< H 

o o u 

< < 

CJ CJ U 

u o a 

cj cj cj 




u u u 

E**^ 

< < < 
o u o 

ID O tD 




u o u o 

(J C_>" O t;) 



cd o 

CJ H H 
823 < 
U O 
CD < 

e- cd 
< cj o 

£h CD < 

O CD CI> 

CJ O O 

Eh U U 

CJ U Eh 




O EMEU 




1 1 O fE5l CD 




OlO CJ CD O O O O O O U CD 



Eh O Eh £h 

CJ L) U CD 
SIS! < B33Ct£H 

h o < 





(J <J o 
u u < 

!G31 Eh pDi 

u to u 

U U CJ 

J833 CD CD 
CJ u u 

cj tenia 

fiC CD CD 





Eh EhO CD CD 
rgQESOHSlCD CD 

u u 



O eh a 

CD C5 O 
SK01C33K31 

CD CD CD 
i5_CD CD 
CD CD 
O 




CD < CD 



(J I U U O O U U U (J U- U O L> 
o a CD c? w o- t5- O O <J 



O O E-* H H H "H O U U U U 




CT» (Ti O H H CO 
OA fQ v CNi (M 



" — " H W (M ^ — 





1—1 






ID 




rH 






cn 


ro 


m 


r- rH 




ro 


m 




iH 




rH 


CM 




CM 










rH 


iH 


rH rH 


tH 


CM 


CM 


CM 














O 


o o 


O 


O 














O 


o 


O 


o 


O 


O 








2: 


O 


o 


O O 


O 


O 


O 


o 








z 


2: 
















2: ^ 








Z 














o 


Q O 


o 


o 
















Q 


a 


a 


a 


a 


a 


rH 


rH JH 


M 


IH 


o 


a 


Q Q 


Q 


Q 


a 


Q 




tH 


M 


M 


rH 


M 




































o o o o 


a 
















a o o a a o 


pa 


pa pa 


u 


pa 


ooaoaoaai 




w 






W 


u 


CO 


CO co 


CO 


CO 




Ed 


u pa 


pa 


pa 


pa 




CO 




CO 


CO 


CO 


CO 










CO 


CO 


co co 


co 


CO 


CO 


CO 
















CTl O 


r- 


CD 


















en 


Csl 


to 


CM 


rH 


o 


r-t rH 


rH 


rH 




GO 


co 


CM 


CO 


CM 


rH 




Csl 


CM 


a 




hP 


* 






o 


CN 


x: 


.C eg 


CM 


a u 


4-» 


• 






a 


a> 


M 


lO 


* • 


* 


* 




* 






a 


0) 


M 


CD 


• 


• 


3 




to 


o 


OA rH 


rH 


tH 




in 


CD - 


■ 


3 




CO 


CM 


rH 


m 


4J 






o 


O O 


O 


O 




o 


<M rH 


in 


4J 






rH 


O 


o 


P 








O O 


O 


o 


fH 


o 


«H O 


o 


3 


h»J 




O 


O 


o 


*H 


CO 


CO 






a; 


rH 




o o 


o 




CO 


eo 




J* 


^± 


* 


tT5 


CO 


■ 


<X Cu Cu 


a 




CUM M 




• 


<0 




a 


a a s 


0) 


Q) 


o 


• * 


• 


* 


a 


* 


a a 


a s 


0) 




• 


■ 


• 




>i 


>i 


CM 


u o 


o 


o 




rH 


■ ■ 


* 




>, 






u 


o 








CO 


rH CM 


CM 


CM 


c 


U 


rH G 


u 








rH 


tH 










i 


co 


JQ 




o 


M 


rH rH 


tn 








CO 














o x: 




x: 


tH 




« -Q 


M 










•a 












0) <u 


<D 




M 




T3 


T3 










> 






















> 


2 









tHcOinr^oArninr^rHOAcotnc^ 

•* rH rH rH rH rH CM CM CM 

o o o o o 

S;ZZ22:00000000 
o a a o a 

MrHrHrHIHQQQQQOQCi 
H H H H M >H M H 

o o o o o 

uuwticiioiaooiotoaa 
cococococopampac^papapapa 

p* o\ o r* CD 

OcHrHrHrHrOOOr^COCM CO CM H 

• a-H t) o cm x: x: cm cm a u u 

iO * * * **H « • ^ *M Qj <D M 

O OA rH rH rH • tQ OO • - 3MC0 

.ooooor-ocMiHin^j 

^OOOOtHOrHOO p p 
a^^^XH^OOOH CO CO 

: :d.'0*c5H,as<D<u 

CMOOOO * rH * • . >i>t 
CO rH CM CM CM C M rH CJ O 

B u h h in 

TJ O £ i: H (0 XI ^ 

a) a> a) a> u t) t) 

> 5 



ro 

CM 




CMDCTITIITC CUCCT /Dill ET OC\ 



WO 01/21650 



PCT/US00/25856 



5/24 



o _ 

U O O .„ 

o o u o 
[§J^E^1[^ o o u o 




o o o o e> 

H H 

o a 



o < u u 

O O 




a o u 

O O O BSfSSESSl 
^ (J u < o u 
o o o 
o o 



u 

O O O < E-« 
(JOCiU 

o 





o 

h ^ii^e^^i^aSiiiifiss^ "SSI o < h 

p£ CD H £^ H H H O O CD U H O 

CJrfHHHOOtflOCJ^H 
O 2 H H H H H. o < o a o 
ril < h H h U U 0 O O O h _ 
O O < tKtEaO&SEpfEpS O 

0<£-<E-«HHHOU <T < H O 
OUC9HOO*(H 



O 
CJ 



»K1 



H HiST O O O "3T O rtf _S_ H 



*h H 
m 



H H 



H 

S3 
H H 



KB 



to o 



a u o h 



^ooc^i^r^oocM^rrHr^fOr-r^ 



o 
in 



H^OOOrfJtflOHrtHrfJrt: 
O O H E- E-v H H U O CJ CD U E-» 
CD CD OdESEErjiEtJfErS U U O FE5Jf5fl 




m 

H (J CJ O H O 



< h o u u o 
" ' IBjjl cd o cj cd 

■-ooo 



< < ^ ^ < 

'rf «r ^ 



3f I ^ i < i i ^ 




E~* U C^ 1 

fSlGQBfll 



OHO OHO 
H CD H H O O 
H < O CD < O 

Ofeao o 
o o o o o 

O CD EEfl£E5i 
H O 

O CD __ __ 

o o 

a o 

H O CD CD CD CD 






O H H O O CD O 

o o o o e> o o 





H KQf^BlK^ H 
O O < H H 
pa£3fE3 O 
O O O H O O 

ouora 

^* ^ ^ U _CL 
O O O tEB < 
^^^O O H 

" O O H figa H 



o ley o 
^ o ISI 




H ^ ^» 

MHCM(M(SjHCMmfnCNHCS|rO 




E^O O 
H H «C 
H H O 

o 




H #i ^ 

ICB^JO 

<_ O H 




io Bp I 



CT> r-t ^ 

*a* oo r~* 
n h <m 





m to 


r- 


cn 


CO 


in 






cn 


ro 


m 












r-l 


rH 


rH 


rH 


rH 


CM 


CM 


CM 


o 


O O 


O 


O 


















z 








O 


O 


O 


o 


o 


O 


O 


O 














21 




z 




Z 


Z 


a Q D Q 


a 




















M l-t 


M 


M 


o 


o 


O 


Q 


Q 


Q 


Q 


a 


aaoa 


ex 


























CO 


CO CO 


CO 


CO 


w 




Da 




CJ 


W 


b3 


us 










CO 


CO 


CO 


CO 


CO 


CO 


CO 


CO 




m o 


r- 


CO 


















o 






i-H 


CO 


CD 


r- 


m 


CM 


Of) 


CM 


rH 






■a 


o 


CM 






CM 


a 


M 


-P 


m 


* * 




* 




* 


• 






a 


0} 


M 


o 






rH 


* 


in 


00 


* 


+ 


a 


M 


ca 


o 


o o 


o 


O 




o 


CM 


rH 


m 










o o 


o 


O 




o 


i— 1 


O 


o 




■P 


-P 






M 


f-t 


J* 


O 


o 


o 




CO 


CQ 


• 




a 




a m 




M 


* 




(0 


O 


» • 


« 


* 


a 


« 


a a 


a s 


a) 


<D 


CM 


u u 


u 


u 


. * 




* 


* 


* 








03 




eg 


CsJ 


c 






u 


o 














-Q 


o 




rH 


rH 


in 








% 








*H 






€ 


M 












V 












TJ 






















> 











• *• »HHHHHMCM(M 

O O O O o 

ZZZZZOOOOOOOO 

Q Q Q Q Q 

MMrHMMQQQQOQQO 
MMrHrHrHrHrHM 

o o o o o 

cococococowwcjcduwpau 
— — — — cocococococococo 



r- cn o 

O rl H 

• CU*H 

IT) • * 

O C\ rH 

O O O 

-V o o 

a, *m 

• cl a 
o • • 

CM O O 

CO rH CM 

a) a> 



rH rH 
TJ O 



o o 

o o 

a a 

* m 

6 a 

CM CM 

<u a> 



ro oo 

cm x; 

• in 

r- o 

rH O 

rH <X 

* a 
a 



r- m cm 

jZ cm cm 

co • 

CM rH 

rH O 

O O 



m 
o 
o 
J* 



ffl CM 



• a a as 



o 



u 



rH O 
•H rH 
« XI 



o 

X) 



CO 



u 

CQ 

4-> 
CO 
CO 
CD 



1— I 




m 


o 


O 


O 




z 


Z 


ID 


ID 


ID 


a 


o a 






Da 


CO 


CO 


CO 


r- 




o 


O 


rH 


rH 


« 


a 




m 




* 


o 


0\ 


rH 


o 


O 


O 




O 


O 








• 


a 


a 


o 


» 


« 


CM 


o 


u 


CO 


rH 


CVJ 






X) 


i 


o 






o 





I 

CM 




WO 01/21650 



PCT/US00/25856 



6/24 





C3_fa=Ci£-» 
IBS** H 
E-» tr* 

e> ^- <d 



Eh U 
CD C3 O 

o utsi 
u <j> ^» 
o 



o 
m 



Eh i 



» 

i i i 

o o i i r i 
H Eh O Eh 
Hp EE S3 C) 

cd cd cd 




ffif ^ iSJ 




U O H 

Eh Eh 



HUH£h 
CD_ CD O O 
O CD O 




U O H h e U H 
[?82J[EaB^33£g eh 333 

CD U ^rfTh Eh O Eh 

= CD 

O U ^ 

*£5?fggcD 



K£>ftt!D*tC9at 
U U ft < 

h h o u 
uuuu 
u u o 

Eh 




I I h H H h H 

I I < < O CD CD 

i | fn h h h 

i I I 1 i 

cd < < < 

SESBEa Eh &h em E5£@a 
cd H (asstgEBg cd 
^ o u o raaiaa 

U < 

H h h Q_ O O CD C2 

1<J> CD CDffitQO Eh 

t Eh H " 

^ [IBISB1 

< CD CD fn H H U 
iSB CD *C 

I 
I 
I 

i 
I 
I 

u 




CD CD CD 
Eh O U 

8285333053 
rtj CD 

u u o 

OS 

£h 

U Eh 




^^rHCMiniOHCDrHD 
CMCMrHPl^^CMCMCMrO 





a* ro 


m 


rH 




m 


in 




• * 




rH rH 


iH 


rH 


CM 


CM 


CM 


o 
















ZOOO 


O 


O 


o 


O 


O 






% 2: 




Z 




Z 


Z 


a 


a 














M 


rH Q 


Q O 


o 


o 


Q 


o 


a 




l-H 




rH 


M 


rH 


rH 


w 


a 


a 














u 


uooaaoooo 


CO 


CO ca 


to UJ 


w 




W 


W 


w 




« CO 


CO CO 


CO 


CO 


CO 


CO 


CO 


r- 


co 














rH 


f-n n 


co r- 


en 


CM 


to 


CM 


rH 


TJ 


O CM 


x: x: 


CM 


CM 


a 


JH 


4J 


* 


* -H 


* • 




HH 


cu 


a 


M 


rH 


rH ■ 


lo oo 


• 


* 


a 




CO 


O 


o r- 


O CM 


rH 


in 


■p 






o 


o rH 


o <-i 


O 


o 






XJ 


J* 


J* rH 


X o 


O 


o 




CO 


to 


a 




CUM 


M 




• 


<a 




■ 


• a 


* a 


a 


a s 






o 


o * 


rH ■ 


• 


* 








CM 


CM c 


I* rH 


u 


o 










XI o 


M r-H 


rH 


LO 








x: 


JC rH 


CQ 




J* 










o u 




TJ 


TJ 














£> 


3 









h n m ^ n 

OOOOO- 
& 2 % Z 2 O 

OOOOO 

H H H H H Q 

a a a a a M 

CO CO CO CO CO u 



lo rH ro lo 

■H rH rH rH CM CM CM 

6 6 6 6 6 6 6 

z z z z z z z 

0000000 

H M rH rH rH M M 

0 0 0 0*000 
tJUWW WW w 
to CO CO CO CO CO CO 



r— cn o 

UHH 

in ■ • 

O <T\ H 
OOO 

-V o o 

* a a 
o • * 

CM u u 

(0 rH CM 

•a o ,c 



rH rH 
TJ O 



o o 
o o 

a a 
• • 

0 o 

CM CM 



CO CD r- 

cm x: x: 

•H • * 
* LO CD 

r- o cm 

rH O 1H 
rH ^ O 

a * cu 



0) 



J3 



c 
o 



M rH 
M rH 
(0 



CO CM 
CM CM 

• • 

rH m 

o o 
o o 
>; 

a a 
« * 

u o 

rH IT) 



10 CM 

a u 
a <u 

4-1 

rH tO 

• nj 



4J 
M 
co 

CO 

m 




CD CD CD O CD 
CD CD O 

CD CD E-r 
Eh 



ESI 
O (J 
O 

o u u 
poo 

H H H 

e) o < 






u o 
< < o 




U U U 
WOE- 




o 

O h h h 



„g^o o o o 
000 



CD CD 



CD < Eh Eh 
CD OIE3aES3_ 

ueiooo 

H .^_U U O 
Eh 

{0 

CJ CD CD C3 CD 

ILL" " 

^ « Eh £h CD 

lo 1 1 f 1 1 





CD CD Eh 
1 1 

CD CD SC. , 

Eh H O I 
ib^IHtJ^ CD _ 
U U U U (J 
fr* H U 

0 o 

1 1 



s 



1 r 1 



mr^cncJicrir^oorHOr^ 
iHfOLor-cr»roLor^iHcn 

f— l rH 1— I rH rH 

00000 

zzzzzooooo 

rHMMMrHOOOOQ 
^ ^ _ rHMMMM 

OOOOO 

WUUU3WOOOOO 
COCOCOCOCOtJCJCxJUlW 

w " w " w COCOCOCOCO 

r^* cv o r*- 00 

* D*-H *0 O CM X CM CM 

^ h • * j«; m 

O <Ji rH i-H rH * LO OD • • 
OOOOOr-OCMrHLO 
-X O OOOrHOrHOO 
CX^i ^^-SirH^OOO 

- d< a a ^ 

CJ • • ♦ . CL, 4 CL q. Ch 

CMOUUU . rH . * . 
OTrHCMCMCMC^rHUO 

g W XI XI pQ O ^HHU) 

TJ u x: x: x rH coxj-v 

<U <U <U 0) M TJ *0 

> 2 



m 
I 

CM 




QMRCTITIITP CMPCT /Dill C OC\ 



WO 01/21650 



7/24 



PCT/US00/25856 



fn CD 

h tea 
< o 
o mm 

<■ u 

E-< Eh 

< m 

U CD 

s=cir ""■ 



< h H 

< u o o 




U h H 
Eh ^ H 

CD £h H 

H o <c 

!ESEH1 eC 

<T cj u 

Eh O O 
CJ O Eh 



C3 CD 

L0 CD CD : 




*T IT) ID 
Csl ^ 
rO CO 



ro m r- 

CM CM CM 

606 
2 2 2; 

Q Q Q 



a a a 

w CJ w 

CO CO CO 



CO CM 



Sh 
CO 

to 
a> 




Fn fn 

h e? o 

lego u 
iD m 
m& 

iD iD 
O iD iD 

h o a 



< *< u u u 
<J (J CD *C < < 
U < O <U CJ 



o < o o 

rfCDUU 
E8IS3CD CD 
CD H E-« 



fkjf 

<CD CD 

O H E-t 

u u o 

CD CD CD 

H Eh 



< 



H e-« H L> 

icd o o tea 

U C5 O UCJ 

£h H H H 

<C < < 

H H CD CD 

CD CD 



H H « C5 C5 
«C »=C 3 < < 
CD CD CD CD CD 
4 »a; < Eh Eh 
<J O O O <_> 
""IBS 




U U U < 
CJ O H H E-* 

lE&sgaiaau cd 




HHHHHCMCVICN) 

OOOOO T 

2S2;S^2:0O0OOOOO 
Q Q Q Q O 

rHMrHrHrHrHrHM 

wiouaaaaoaoo 

crv o cd 
OHHHHroajt^ncN tn CM H 

to. - * • - . >; *h a a> m 

O d H H H » IT) 00 • • 3 14 CO 

ooooor^ocN*-*tn -M 

^OOOOrHOrHOO O-M-M 
a^X^>!H^OOOH.tfl to 

a - * • * a * a Qi as <v o 

CM O O O O • rH • • • >i >i 
W H CM CM OJ C M rH O U 

£ « H J3 XI O M H H in 

-6 u x^^h w -9 

QJ <D QJ 0) Jn X) T3 

> X 




CJ 
SBEh P 
CD CD 




IsESKS C9 



Eh O CD CD CD CD 






< 








CD 


CD 


CD 










< 


< 


< 




< 




CD" 




ID 


iD 


CD 


CD 


CD 



!i£H5 
U U 
CD CD CD Eh H Eh 
Eh Eh E™^ 

u u_ u c_> o 
lu u 

O OOOO 
H H H H H 
II I u u 
J I 

u o ssiu a 






in 



&H iOTH 

H O U (J 

U U lEBiEaiESieSJEBiHa 

U O O 13 O O O U<U 

lire 
E-< Eh 

SSHDSHCD CD 
CD CD {E3ES31 _ . ^ 
H H O O U H < < 
Eh Eh 5GH O H H 

B3G0IB1 

Eh CD CD CD 1 I »5. CD Eh O CD Eh 





ai us* 



u u o u 




ro ^ ^) vd r- h n in cm 
iHrninr^<TifOinr-fHO\rninr- 

HHHHH(MN(N 

OOOOO 

SZSSZOOOOOOOO 

OOOOO 

MMMMMOOOOOOOQ 
l-HMMHWMI-ll-H 

OOOOO 

uuwcdcijoaoooooo 

07COCOCOCOU3UWWWWUW 
wwow^o)cOCOW CO CO CO CO 



- a 

vn * 

O <Tv 
O O 

o 

• a 

o 

CM 
CO *H 
g CO 



o r- 

rH i-H 



o o 
o o 

a a 
• • 

o o 

CM CM 



00 

H O CO fO CM W CM 

O'CM £ r N CM Or M 

■p 'H • * ^ *M O. <U 

rH * 1/) 00 • • P U 

o cm t-i in +-» 

O rH O O ^1 

O O O rH 

Cu ^ ^ 



4J 
M 
CO 



o r- 

O rH 

M rH 

• a 



a o* o- s 



o 

CM 
QJ 



* rH 



CO CO 

03 IT3 

0) 0) 

>1 >1 



c 
o 



to 



o o 

rH in 
> 3 



CO 
I 

CM 




WO 01/21650 



PCT/US00/25856 



8/24 



O 



U O CJ Eh H 

*c <c < 

< < < I I 

H H CJ t I 

CD CD CD < i 

H H H Eh H 

H £-« £h < < 

H E-« E-» 

cd cd cd 

Si rfi 

CD CD CD 

H JH Jh 

cj o cj 

o u u 

Eh H H 

e> cd cd cd cd 

H H E-* I 

E-t Eh E-i 



CD CD 

CD O 

CP CD 

< < 

CD O 

i 
i 
» 







CD CD 
H H H h H 

O O O 
H H R CD CD 

£h C-h 

y o o *5T *c 
o cj 



CD U CD 

~ CD 1 
B28 

Eh 



o o o u o 

H H H O CD 



o 
o 

CO 



CJ CJ CJ O 



U 
J 
t 
I 
i 
I 
I 
I 
t 




CD CD 

S S a 

o u u 

H. E- CJ 

uuu 

O CJ CD CD CD 
O CD CD J I 
U I I 

I I 

II _ , 

. , . B33IBB!Ea U O^OISSI 

CD CD CJ U CJ CJ CJ CD CD < CD L> CD 

o o o 

!u 

iisai 

CD CD " CD , _ _ 

CD O CJ CD 

_ 09I< SE3!1 

< CD CD CD H CJ 
CD CD {^Bs"SI3n ^ 

u&mtaWawm cd 

O CD CD CJ O 
rt! CD CD CD 

*a! < < h e- 
u u u u u 





CD CD 

tearatEaiBaiEffl cd cdibs 





m 





CD 








U 








s 




H 












O 








cd 












5 




C2 



Bp 

U ft CD CD CD 
< Eh CD CD CD 

I Eh Eh 
I H Eh Eh 





CD CD 

< ft O CJ CJ 
CD CD Eh Eh £H 

< CD CD CD CD 
CJ U CD CD 




U Eh o CD CD 

I I CD CD CD 

I 1 Eh Eh E-h 

I I O O I 

CD O < < I 

I I Eh Eh | 

I I CD CD CD 



CO I 



< *t < 



HHHiHHMCNJCM 

O O O O o 

ElSESZZOOOOOOOO 

a q o o o 

a a a a a _ 
w cdww oc*aaaaaa 



O iH 

• a 

in « 

O <T\ 

o o 
^ o 
a ^ 

* a 
o - 

CM 

TJ 



O l> CD 
H H H 
-I'D O 



n cm 

CM CM 



ro co r- 

CM X! XI 
, * » -r»( • * 

H H H • lO CD * • 

ooor-ocsitwm 

OOO^HOiHOO 

a a a-* cum m m • 
. . . * * a a a £ 
u u u o * h » * 

H CM CM N C ^ H O O 

cn X5 -Q XI o M h h m 

OJ <t> Q> 0J )^ TJ t3 

> S 



a n 
a a> 

d jj 
* <a 
>1 



u 

CO 
4J 

cn 
rd 



HHHHHCMCNOJ 

O O O O o 

2222:300000000 

a a o o a 

MMhHI-HHHClOOOOOQO 

i " i n ^h n n 

cacao 

to u (J [J [J aaaaaaac 

— cococococococo.co 
r- o co 

* ft-H o csj x cn n o*h-m 
ir> • • - -*h • ' ^ a <d ^ 

O C7^ H H H •LOOD • *PUtQ 

ooooor^ocNirHin-u 

J^OOOOrHOf-HOO 3 iJ 4J 

o - * * • a * cua as o> a) 

cm o o o o . r-| • . • >t >i 

g ca X) o V4 h h m 

O 0) fl) dJ M T3 TJ 

> 5 



VD 00 tf> iO CD 
H ^ H 

VD in in 



h ro m c\ 

6 6 6 6 6 

2 2 ^ 2i 2 

o a q a a 



a c o c a 
w u to w u 

W CO CO t/) CO 



cTi o 

O H H 

- ft. -H 

in • * 

o cr» h 

o o o 

o o 

a ^ 

• Cm a 

o * • 

CM 
CO 

T3 



o 

CM 

x: 



r- oo 

TJ O 



o o 
o o 
J* ^ 

a a 

• • 

o o 

CM CM 

XI -Q 

x; x: 



i 

CsJ 




CI ICCTITI ITf= CUCCT /Dl II C OC\ 



WO 01/21650 



PCT/US00/25856 



9/24 



fn Eh 

U O U 
< < 

u o 

0 o 

£- £h 
(JO , 

e-< t 

1 I I 







ID ID 

U H 

o m cd o 

CD (D < H «3 
U E" CD U 

H fC H U 

0 V u < o 

U 13 H 13 U 

id jcsusa < < 

1 t I u o 

u o 

E-« ._.„. 
tD U 

U H KfflES 

_o ID a 

CD < H E-« 



H £-» tD CD (D CD 



o o o 

Eh <J U H 




Eh Eh 
Eh H 




H too 
a iD o fir « 





Si 



U U U H H H 

o a is 



o o o 
u o 

aSu u 
o cd 

Ei Ei 
E- E-« 

U h H 
(J o 

o id 
o u u 

U (J o 



2-35131 CD 



U fESIfESICD 







o o 

CD CD CD <J CJ 

e- o u 

u c_> o 

CD (D I I 

ft < t I 

_ H O 1 l 

H H b O O 



t 

cd 

£-* 
E-« 

< 

H tC*3i CD KsH 

< cd e> cd 

U h U H 

Eh tSli 



o u ogs 

< O rt! U U 

o o o id cd 

cd j&teteBss 

fM^&SKH 

O U U H H H 
CJ < tf u 

irao cd 
sa i t 

1 1 t 
I I 1 






H H H O (!) 




ttHjtfc^afEH 

wsita»tt£B E-i Eh 




Eh Eh O U U Eh 




O 

_ _ __ O O 

u u u cj u _gf 5T 5T o 

U U L) S33f&IiSl!££! <J) g 



in 



O Eh Eh 




S£ &f 

r4 Ra &k f& Mif^^h^ 



OOeJOOEHHOOOEH 



CD 


CD. 






CJ) 





OOOUUOOCDOOO 
!FE3[StE30I < < rij 




i3? 



u o 




H h H U U 



si 




CM^rPOnHOHCONOOm 



mLnr-rHOOOmr- 

HHHHHW(\JM 

ooodoooo 

0*0000000 



fO CD n <N 

CN ^ £ M OJ 

-H - * ^ *M 

• iO 00 * • 

r- o N h in 

i-l O iH O O 

>; o o o 
^ ex ^ 
Q, * Q* a* 



01 CM 

a u 
a <» 
p u 

p 



4J 
CO 

CD 



01 

03 

>i >i 



o 



^ h h in 
> 3 



HHHHHCJ(MN 

§S§SSooooggoo 

O O Q Q Q 

MMMMMHHM 
CO CO CO CO C/3WWWWWUUW 



<ri o oq 

o h h h h n CO 

* OU -H "O O CM ,C 
lO • • • • -H • 

otnHHH * in 

o o o o o o 

O O O O tH o 

a ^ ^ ih ^; 

* a a o* & ^ a 



o 

CM 



a o 

rH CM 

to X* 
O £ 
0) CJ 



o o 

CM <M 

x: x: 
a) a; 



a 
* 

o 



ro cm 
cm cm 

CD * • 
CM H m 
r-t O O 

o o o 
^ M M 
a a a 



01 CM 

a n 
a o 



4J 
CO 
-P 
03 



cn 



u 
u 



o o 
n ^ 

> 3 



iHmmr-onmr^i-Hcriro 

«H rH i— f t— i t-H cm 

O O O O O 

z%z^z;oooooo 
z z 2; is 2; % 

QQQQQ 

HHMhHhHH-*oooaon 
o o o o o 

tdwwwwooaoao 
— — — ^-cocotowcow 



a\ o 

U H H 

lT) . • 
O H 

o o o 

J4 O O 

q, >; ^ 

u • * 
CM O O 
0) H CM 



co 

rH rH CO 
TJ O CM 



CO CM 
CM CM 



o o 

O O r-i 
M M t~i 

a a 
* * tx 
o o 

CM CM 

XI J3 

^ x: 



o 



m co 
o cm 

O rH 

o 

• a 

iH - 

H rH 

to 



CO 

a 

D 



rH lO 

o o 

O O *H 

J* ^ * 

a as 



u 

rH 

TJ 
> 



o 
m 

TJ 



i 

CM 



WO 01/21650 



PCT/US00/25856 



10/24 



u 





c^ o 





Ih HI 



Irssisb « RfSEa ^teh 

o o o o o < 3 cs u 

§ ^ TJ u~ TT cT o 
^i^rHJlfeJ^I 





fix rS^ ffa^ 



o o *t 

CD H H 



Eh 

o 




U ol^DE3iBS3U u 

< <J O O CD C9 CD 

^ h h h t^BBsagta h 

S^r^rtriwai ^. _ 




[tea o u o oo 



m u H h 
o < 

rH (J) 




E- 1 



u u o u u 
< *=c cd < 



o cd u o cd 



< 



5GSffiD3 Eh Eh 

cd eh 

< CD UU'U-U 
Eh Eh Eh 

o res (J u <J 



*c < u o 

CD U Eh O 



u 

Eh 




Eh 



Eh 



cd 

Eh 



O U O CD CD 
H H o u u 




car 

ten 

< Eh U 

o o o 

Eh Eh CD 
Eh Eh CD 

o u SKiaalia 





cd cd cd cd cd 



CD H 



CD CD 




ti^lE^fofik^tHi CD CD U O J63fE§| 

DS5S O i U 

cd o 

H S O 
Eh 
CD 




CD Eh 



»H CD CD 



CD CD CD 
BEBlESiaSlI 

CD O Eh U U _ , 

OO U O O CD CD O _ 
< ril CD U U h H U rfO O 

cd cd cd ^sctianias u c&raarSh 

00<«*^CDCDC5CDEh^ 
EHtnOUUCDCDOrtEHCD 

H h eh h eh gxa^iiSE^fisai&a 

U CJ U Eh Eh dJrtT CD CD CD CD 



CD 



o 
o 




CO 



cm 

■ ■ • * 

O O 



Ed Id 
CO CO 



csi-^mmi-HOrHoocNjcDmcor- 

HHHHH(N(MCN 

o o o o o 

rszazssoooooooo 

QQOQQ 

O O O O O 

— — — cocococococococo 



Cs| ro m H O H CO CN CO to CD i— I 
HHHHHCMOiOJ 

O O O O o 

zz^z^oooooooo 

Q Q Q Q Q 

o a a a a 

— ^ w w to CO w w w w to 



GO 



o 



o 

CO 



CM 
M 
(1) 
U 



AJ 
U 

in 



ro 

a> cd 

>1 >! 



r- o 

m - * 

O CTi i-H 

o o o 

j-; o o 

* a 

o • ■ 

CM O O 

tO *H CM 

-dux: 



oo 

iH ih co oo n n 
"0 o cm .c ja <n cm 

H H • Lf) 00 • * 

o o r* o cm rH in 

O O rH O iH o o 
^ H^OOO 

a, a. o. m m m 



o u 

CM CM 



x: 

0) 



c 
o 



a a a s 



tn cm 

a m 

tn 
a> 



to 



o o 
i-i in 

> 2 



CO 

to 

A3 

>1 



os 

• a 

in * 
o c\ 
o o 
o 

a-^ 

* a 
o • 
cm a 

CO rH 

£ to 

-o a 

0) 



rH rH 



H CI CD 
O CM 



O O 

o o 

a; a; 

a a 

o o 

CM CM 

ja n 

x: sz 



i— i -in 
o r- o 

O rH o 
M i-i M 

a-^ a 
* & * 

O • rH 

CM C U 

A O U 

JC rH 

0) u 



r- ro cm 

X cm CM 

00 * * 

CM rH m 

rH O O 

O O O 

J£ M M 

a Qi a 



CO CM 

a u 

Or 0J 



to 



o a 

rH m 

> 3 



CO 

a) 



4J 
M 
CO 

CO 



r- 
u 
* 

to 
o 
o 

a 
* 

o 

CM 
(0 



CO 

I 

CM 




CI 1QCTITI ITP QUCCT /DI II C OC\ 



WO 01/21650 



PCT/US00/25856 



11/24 




S 

o 

O 

< 

u 

u U 

H 

KB! 

i 

o o u 

(J fcCJ 



O H 



^cocooor^oo<ri<rir^r oo <r> 



HHHHH(V|OJOJ 

oooo "i: 

^22?:oooooogg 

O O G O 

OOOO 

— — towcocncneocnto 



cn o 

f-H tH t-l 

QU *H "0 

* • » 

cn H H 

o o o 

o o o 

^ M M 

a a a 

* • * 

u o u 

H CM N 

o s: 

a) a> a) 



oo 

i—l CO CO 

O oj X 

t-c • m 

o r- o 

O «-H O 

^ rH 

a a 

u • <-» 

eg d M 

x: h 



ro <m 

£ CNJ CNJ 
CO * 

(sj h in 

iH O O 

000 
^ M M 

Cu Cu Cu 



« CM H 

a u 4J 

3 u w 

4J 

H 10 (0 

S QJ <l> 

>1 >! 



co 



o 

> 



O 

tn 



CM 

m 



i 

< 

o 
< 
< 

H 
tD 
O 
U 

£h 

< 

E-< 
E-< 
E-t 

O 
O 



lOcDf^r-criifior^noin'fiO 

00 CD 00 00 1 — CT| CT> CT) CO I CO o 

* -* ' " " ' — ' * ' ' ~" — " " ^" * ~ 1 "* " 1 ^ 

HHHHHOJCMM 

§§§§§06000000 

00000 

MMW i_iMoaooooaa 
a o o a o 

wwwwwao»oo<aoio<q 

— — — — {/} (/} CO W CO ^ coco 
cn o a> 

♦ O. T3 O CJ £ X N (M. (X^4J 

to h - . *w a a> n 

O H H rl < m OO * ■ D M W 

ooooor^-oojr-iin^-) 

-i-iOOOOi-HOi-HOO 
aJ«!>i^^H^OOOH (0 CO 

* aaa a-* • £ * 
o • • • * a • a a as o a) 

CM o o o o • • • • >>i 
(OHNrgcMC^HUO 

e tn X) XJ o HHHin 
ujs co ^ ^ 

lU OJ O (U U T> T) 

> 3 




WO 01/21650 



PCT/US00/25856 



12/24 



o 
in 



> 
>> 
J 
iD 
CC 
X 
n: 
z 

CO 

to 
cu 

>< 

o 



>< 
.J 

2 
2 
i 



2 Z ^- 
>< 



2 2 2 



< I 


*4 




2 


fcH 1 


to 


CO 


CO 


o; i. 


Q 


> 


hi 


j j 


E-* 


rH 


CU 


^ i 


Z 


hi 




X 1 


CJ 


CO 




Cm ) 


H 


a 


1 


a: j 






j 




CO 


o 


1 


> 1 


Q 


CO 


l 


co i 


> 


a: 


1 


t-H I 


CO 




l 


a. | 


cu 




i 






Cm 


■ 


tM I 




Put 


■ 


CO 1 


CO 








# 


w 




rn i 

^* * ■ 






■ 


O 1 


CO 


a; 




M | 


CD 






CC 1 


hi 


Pi 


j 






S-* 




V J % 






I 


ai i 


1 






Ui | 


1 








1 




■ 




1 






CO 1 






} 


CO 1 


j 


CO 












t 


! 




[ 


t/i I 








^ 1 




CO 












H 1 




CO 




i 




cu 




ei i 




cu 




J 




Eh 




z i 








a i 




> 




< i 




hi 




CO 1 




CU 




rH 1 

CU I 




M 

Cu 




Cm 1 




hi 




Pi t 




Oi 




Cm t 








CO I 




CO 




1^ 1 




Cm 




S I 




2 




rH i— i 




rH 


rH 




*1 
w 



cu 



CU CU Oj 

H H h OS 

H H H (j 

CO to CO Q 





i >> 








i ^ 






oc: 


1 O 


u 




w 


1 M 






w 


i a: 


es 






l 2 


2 


2 


to 


1 >• 


>- 


>-< 


2: 


1 


i< 




M 


t e) 


w 


o 


O 


i j 








i *i 




M 




1 PS 


a: 






1 (m 


u. 








a 


1 CO 


CO 


CO 


co 


1 cu 


cu 


CU 


d 


1 OS 


CC 


Q£ 


55 


1 CO 


to 


O 


rH 




r- 




CO 









Cu 



o 



OCUOU^KOCOQQ 

H J H CC CC cC CO Z¥ 
Cxj Cu Iti S Cu Cm 





CO to CO CQ CO CO CO 
LLi Lij Cu Llj 



W Cm H H h U "hi 'h1 
S ^1 !> > H ^ CO W 





CO £h 

DS Jui h4 hi 

< Oe: Iju Cm >h 
823H 
W 

S co 

« h o o u co co co 

O S CD CO < < CD CD 



CO 

oi co co 

> Cm 1x4 

EES EES 
> > 

OToi Oi 





CO CO 





fO H VO 

cm m — 



rH rH; t-4 c— I f-H CVJ 

O O O o 

2;2;ZZ:000000 
SS % Z 2 2 Z 

Q Q Q Q 

Mh-itHhHGOOOOQ 

M JH M M hH M 

aooo 

tdtudwaaaaoa 

cocococoucauuiuu 

— ' — — — CO CO CO CO co CO 

cn o 

OHHHoonnoDr^w 
in * * • o M • • 

O ^ H H • * ♦ O 00 

oooorHi-tr-mcsim 
>;ooooOiHOrHO 

• a a ^ 
o - • ' a a a a a a 

OJ o o u 

g co iJ J3 CJ h o M *— I cn 

0) <D <D ^ T3 M 73 
<U > S 



CM 




VD 


CD 


o 


CM 




ID 


GO 


o 










rH 


rH 


rH 


rH 


rH 


OJ 


G 


o 


O 


O 














2; 


•z 






O 


O 


O 


o 


o 


O 










Z 


z 


Z 


2 


z 


z 


Q 


a 


a 


a 














fH 


M 


IH 


M 


a 


Q 


Q 


Q 


a 


o 










rH 


rH 


M 


H 




M 


O 


o o 


o 














Ex] 




w 


u 




CO 


to 


CO 


CO 




U) 


U} 






Cm 










CO 


CO 


CO 


CO 


CO 


CO 






o 


r- 














O 


I-t 




i-i 


GO 


n 




CO 




OJ 


• 


a 


•-H 




rH 


OJ 






si 


OJ 


in 


• 


« 


* 


O 






■ 


4 




o 




i-H 


rH 


m 


* 


■ 


o 


00 


■ 


o 


o 


o 


O 


rH 


rH 




m 


OJ 






o 


O 


O 


O 


O 


rH 


o 


rH 


o 








O 


O 


rH 


o 


O 


o 


* 


a a 


a 


M 




M 




u 


* 


• 


• 


a 


a a 


a a 


a 




o 


o 


o 
















iH 




Csl 


o 


o 


C 


rH 


rH 


o 




CO 


X) XI 


OJ 


rH 


O 


IH 




m 


i 


o 


x: 


x: 






rH 


U 


CO 








<u 








V4 






T3 












> 






• 





o o o o 

zzzzoooo 
z z z z 

o o o o 

MrHrHMOOOQ 
M rH M rH 

aooo 

u w aoao 

c% o 

* Qi*H T3 rH OJ OJ 

m • • * o ^ »h. ■ 

O O H H * « *0 

ooootHrHr^m 

-MOOOOOrHO 

* O* Cu Cu ^ ^ 

a - - * cu Cu cu Cu 

04000 • * 
CfJHOlM o a CH 
6 rojQjQojrHO u 

a) o 0 x: ^ ^ 

0J > 





<5I IR<?TITI ITP RMCCT /PI II C 0A\ 



WO 01/21650 



13/24 



PCT/US00/25856 




to to 



Lu Li 



CJ) CO 



O 



oo o 

rH CM 
• * ■ * 

o o 
2 2 

Q O 



O O 
CO C£) 

co co 



r- cm 
x: cm 

• 4-1 

00 • 

cm m 

rH O 

o o 
J* ^ 

* * 

r-H O 

i in 

CO M 

n 



cm h iJ ^EwES tJ tSCif^ 2 H 

ic/HEira cu cu cu u pil^CTtoil 



^ is? 



CO fr* ^ ttf 2 U 

HS> >>ai 

> M Ui Ul M <J 

co oJ u , . . . . 

^>>>>UjHHH^ 

hUhh>^UUh E-» 

earcai a o o ass 2 2 besbei 



O CO CO 

CO 0£ 

Q Q 
G33U3B 

S H H 

Uw > M 

2 (O d 



, , >* >* >* 

i i 

i 1 (2382312! 

W J H M S 

^ H M H . r . . . 

S^tOUtOr^tfoiC^U^ 



> > l f 
^ k I t 

mm i > 

H H W W 

> > cc cc 



> Cxi £ 



COOSSHCOi-^r^tO 

^ o: w w w * 2 z„ 2 
^ j ^ ™ 



2 
o 




^ ^ >* CJ if?3 co co _„ „ 

Q" O O O O CJ) J£ ^ Q Q 



> > > 

Ul ti3 ^ 



>)>!> |>. > 



til X X Di O H Kg g 



en t ^ ^ h n o h 



rH rH rH rH rH CM 

O O O o 

2222000000 

2 % 2 % 2 2 

Q Q Q Q 

mmi-hmOOQQQQ 

M M M W H W 

oao»a 

locococooaaaoo 

— — cocococococo 

r- <t\ o 

OriHHflororocor^N 

• • • O ^ -H * • *W 
O H H - * 1 O CO * 
OOOOHHMTKNin 
XOOOOOrHOrHO 

• CL CL O, M M M JX M M 

a • * - a O4 a a a cl 

CM U O O 

CO rH CM CM UO C H H O 

6 tn X) XI cm h o ^ H m 

OJ Q) 0) *C u M T3 

CM 

I 



O I J 1 I < I I I I I 

in 1 1 J 1 w 1 1 1 1 1 

CM 1 1 1 I H I I I I I 
lilt CD t I I • I 
C00222ao;aSaO 

m& 2 2 zmmmnm 

o >-*>>> w 

H M [d [J W H 

> > 2 2 2 r-1 

15 O W CO w o 
Q Q 2 2 2 Q 
^ ^ *J1 ^ ^ 

b^ii^ to co & m 
\ 1 0 155 

t I ^ X 

CO CO CO u u u 

CO C0 CO M 

CO CO CO CO W 2 

o ^ w [jj w S 

isisiSiSi&Ba 

^> t£i x x ad o 2 

OD(JCJPJQ[dti3DCl 
OOCOCOCO(J>COCOOO 



> 







200ri;cocucuoco 

h h h Rgli>J ^ 

2 ^ J J H (O CO H > 

2 X a ^ fa O? Q 



£3 

w < < w w 

S W S S U3 W 

> j o < < 



oi & to s^!$3fga o ssafsaco o 



H T •q* ^ CO H (T^ ^ 



CM^^OOOCVJ^^DOOO 

H H H H H (NJ 

OOOO 

2222000000 

2 2 2 2 2 2 

OOOO 

MMrHMOOOOOO 
M M M M IH M 

OOOO 

WU3WC0OOOOOO 
COCOCOCOWWtOCdtOCO 

^„„ wCOCOCOCOtn to 



O rH 

• ci- 
in * 
o cn 
o o 
M o 

a^; 
« a 
u • 

rg O 

CQ rH 

6 o 

0) 



o 

rH *H 



o o 

o o 

a a 

» * 

u o 

CM Csj 

x: x: 

CD <u 



oo m cn oa r- cm 

H (N OJ £ £ (N 

o J-i -h - * 4-1 

• * » O 00 • 

H ri if) CM if) 

O O rH O rH O 

O.O iH O O O 

M J* M M M M 

a a a a a a 

O O C rH rH O 
CN H O M rH m 
J3 J3 H M (f) ^ 

aj > s 



o 
o 
n 



H h h h f- h 

a: c: c£ cc cc 

f ! — ! 1 — ( HH »— 1 i-H 1 



> > 



^1 > 



0 


0 


0 




J Q 




a. 


a, 






b Icl. 





M M 

o s 

2 ^ 



>j>j_ X Ui Q 

I 

2 





in 
eg 









(H < 














CO CO 






m 






O 


0 


2 Q 


CO 


CO 


CO 


r-l CO 




1 f$q 


CO 


CO 


CO 


1 > 


CO 


CO 


CO 


1 0 


s 


2: 


H 


1 2 


2 


2 


^ 


1 2 


CO 


CO 


O 


1 < 






> 


1 > 


0 


0 




1 CO 


rH 


n 


M 


1 0 


> 


> 


> 


l < 






CO 


1 0 


CO 


CO 


2 


1 CO 


CO 


CO 


t 


t 


ID 




iD 


1 0 




Eh 


H 


J ^ 


< 




O 


1 *H 


2 


2 


2 


1 a 


< 




< 




CO 


CO 


CO 


1 1 


CO 


to 


^ 




^ 






1 oc 


CO 


to 


CO 


1 1 


0 


0 


O 


1 | 


1 


1 


CO 




00 


CO 




CM O 


GO 


00 




CO lO 


rH 


rH 


rH 


CM rH 











CM 



(N 00 O (M ^ 

rH rH rH 

OOOO 

2 2 2 2 0 0 0 

2 2 2 

o a o o 

rH rH rH M O O O 

M M M 

OOOO 

CO CO CO CO O O o 

CO CO CO CO CO CO CO 

— * — CO CO CO 

r* cn o 

U rH tH H CO CO CO 

* O4H "O H (M CM 

in • * * o *h 

O CjN iH tH * * * 

O O O O rH H 

M O O O O O rH 

q,>; ^ o o rH 

• Qj &• >i 

O ■ * - CL (X a 
CM O O U * 

« rH CM CM O O C 

£ f) U X) CvJ rl o 

T3 U X X i) XI h 

a> a> a) x tj v^i 

0) > 




WO 01/21650 PCT/US00/25856 



14/24 





< 
> 



h h 
a: c£ a: 



>■ a 
out-* 

U ti] to 
< h 
co U i 
S33:z i 
E IB3K3I 
a: u o* 
a; 

Q 

to < 

rH Ut 

.. 2 CO 
JES CO CO 
Q S Q 

CO H H 

> 
CD 

> 
CO 

Q 
*C 
Q 

to 

O 

a 

i 
i 

i 
i 
i 



o i ^^^oaoua; 

a;j*;cococoa;a*cuti:a: 
a: (BBtSBHtBl W3F« 




^^WMSCO^^ 

15: to ibiisheS3 to 



CO 




lO 

a 

pa 



J ^ 4 ^ 
[OSiEJJtEBES 




Q CO CO CO G 

*C Eh 
Q >h >h Uj CO 

co co co E5 



ST ST 

CO COl 
CO CO 




0< Oi 

Q O 

a a 




cauiwii-H m m tea n: t x 

z z z jgg tu bsjeeb 


















iHL 








m 


iD 






(J* 




CO fO 


CO- 




CO 




rr t 


co 




in 
m 



i i r 

i i i 

i i i 

>J t-q 

CO CO CO 

ix^ t£ w js 

x x n: a; 






*r 




Ch 




GO 


rH 


CM 


rH 














Co 


GO 


o 


i— C 


rH 


<M 


O 


o 


O 


z 






ID 


ID 


ID 


o 


o a 


CO 


[0 


to 


CO 


CO 


CO 








CO 




CM 


XZ 


.C CM 


* 


• 




o 


00 


• 


m 


Cs> 


in 


o 


rH 


o 


o 


O 


o 


J* 






a 




a 


* 


* 


* 

o 


n 








w 











(DlNiniOHOD^THOO 
(VJHCNNCNOJHOJCMCM 

rH f-t rH rH rH CM 

o o o o 

ZZZZOOOOOO 
Z Z Z Z Z Z 

o o o o 

MMMMQQQQQQ 
rH tH #H H tH *H 

a a o a 

co to cowaooaaa 
cococotowcocotocoio 

— CO CO CO CO CO CO 



r- cn 

* a 

o cr» 
o o 
o 

• a 

o 

CM 
CD 



-H T3 



O O 

O O 

a a 

* * 

o u 

eg cm 

•c x: 

Q) 0) 



oo rn m cd r- cm 

rH cm cm x: x; cm 

O -H • • MH 

• * « Q T > * 

H H Mfl CM tO 

O O *-4 O *H O 

O O rH O O O 

a a a a a a 



O O C «H 
WHOM 
Xfc i3 M 



rH lO 

(0 M 
3 



aocMCOcocno^cncMM? 

CMCMCMCMCMfOCMCMmCM 

fM^TVDOOOCM^T^DOOO 
iH r-l rH rH 1-* CM 

O O O o 

2:2;2;2;0000OO 

2 % 2 2 2 2 

Q Q Q O 

tH M M rH M fH 

O O O O 

toco w waaoooo 

cocococototococowco 
v — to CO CO CO CO CO 



h m o 

Or-frH 
(O 

o cn h 

o o o 

M o o 

o 

CM 



o 

H CM 

u x: 



rH 

13 



O 

o 
u 

CM 

x: 

0) 



<x> co ro ao cm 

*H CM CM .C JZ CM 

O -h * * m 

• * • O 00 ■ 

rH rH r- in cm 10 

© O rH O *H O 

O O rH O O O 

M M Jtf M M M 

a a a a Qi a 



u o c 

CM rH O 

XX X* rH 

x: t> in 

(U > 



u 
u 



th a 

r^ tO 



I 




CMQCTITIITC: QUPCT /Dill C OC\ 



WO 01/21650 



PCT/US00/25856 



15/24 



o 

IT) 



Ixt Cu Uj 
U H > 

s o 

O CO 

u a 
w 

s 



co 

> 

OS 

< 
> 

a 

a 



CO 

cu 

2 



In 
»J 
CD 
oC 
X 

n: 

E-| 

CO 

►J 
CO 

Cu 
Cu 

>< 

(J 



On 

2 
I 
l 



f ^ 








1 J 






*^ 




PH 


C ■ 




CO 


CO 


#^ 




r* 




1 D 


> 


*J 


» 




i l 




1 H 


M 






1*4 




v; 














T~* 




CO 










Cm 


1 H 




1 


j 




■ 


al 


1 Id 


E-" 


1 


J 


j 


■ 




I co 


o 


f 


J 


1 


i 


> 


1 Q 


CO 






1 


i 


CO 


1 > 




1 




1 


| 


rH 


1 CO 




1 


j 


j 




cu 


1 cu 


X 


1 








cu 


1 J 


u* 


1 






| 


U4 


1 >• 




1 


1 






to 


1 CO 


a: 


1 


j 


1 




lu 


i 2 




1 


! 


1 






i J 


o 


1 




1 


I 




j CO 




1 


J 


j 


j 


1—1 


1 o 




1 


I 


j 






f J 


a; 


1 


! 


1 




-* 


* * * 




1 


j 


1 




co 




Pi 


I 


! 


1 


I 




■ ■ 


>* 


1 


[ 


[ 




* ■ ■ 




3d 


1 


* 


1 


» 


H 




>h 


j 








M 


| ] 












to 




a: 


[ 


] 


1 


j 


CO 


] | 


CO 










DC 




>* 














>* 










to 




u 














CO 










Cu 




a* 










E-» 




CO 










3J 




Cu 










OS 




Cu 










*c 




























a 




> 










< 














CO 




a* 














w 














cu 










Cu 




*i 










a; 




cc 














^ 








t 


CO 




co 










<J 




Cm 










2 




S 




rH 


rH 


1-t 











C4 C\| CN) ■* •* *• *• ^ tH CJ 

oooo *• •* 

OOO^^SSOOOOOO 
S Z % 

Q Q O O 
QOQMHiMMQQQOQQ 
H H H MMMMMH 

oaacjwwwaoaooo 

CO CO-CO — CO CO CO CO CO CO 



r m o h 

H OJ (jHHHOOnrOODr^CM 
4JM • Qi^ t H (N N ^ £ CM 

v-i <u in • * *o^-h * • 

(0 M O (Ti H rl ■ • *O00 * 
01 w^OOOOOtHOiHO 

nj as a*^ J^-^ooiHOOO 

^*>4>ho * * * a (X (X & 

eg O O O - • j " • ! 

Wi-H<MCMOUCiHrHU 

E co.Q.QcNjrHO M H m 



CO 

■p 

3 




o 

2: 



Cu u ^ 

o 2; w 

> n i 

os w 




< 





















CO 




















H 




CO 




n 

a; 










CO 


Pd 


w 






CO 






1 


IF 


SL 




















> 


Eh 






a: 






CU 


a 




0 




Ui 


»J 


H 




M 




g 


1 


CO 






1 




CO 


CO 


I 




rc 


0 


I 




CD 


z 


rH 


CD 








CM 








CO 




eg 




CM 


CM 










0 



a» eu cu j 

CP O CD CJ 

H H H ffi, 

M M n o 

CO CO CO Q 



« ^: 

pc a; 

s s 
>• 

CD U 

^ »4 

OS CC 

Cu Cu 



Cu 
CJ 

CD 
« Id 
OS z 
s > 
>■ u 
be: a: 

cc co 

Cu CD 



to co co 

Cu Cu CU Q 

OS 05 OS H 

w W a i£ 



I 
I 
I 
I 
I 
I 
1 
I 
I 
I 
I 
! 
t 

> 

g 

OS 

n 

Cu 
2 

a 

CO 

5 



cu 
cu 

CO 
CO 

> 
a 

Q 

> 
CO 

o 



rij co 

CO CO 

a 

VC CO 



O 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
t 

CO 

•J 



cu 

> 

Eh 

% 

cu 



CsJ ^ ^ 00 O CM 



^ © O 
H H H W 



00022Z2:000000 

%2S222 

O O Q O 
OOQMHI-hmOOOOQO 

H H H HMtHMMfH 

0000 
a& ctu u* u u ctac* cicx a 

«coco — — w — cocococococo 



CO rH <N 

CU 4-» U 

U <D 
co 
U 4-1 

^ 0] o 

H (0 (0 

. a) a; 

s >^ >« 



<o a> d> ^ t3 



T3 



O rH 

* a 

in * 
o a> 
o o 
o 

• a 
o 

CM 
CO rH 

-a o 



u 



H H CD 
•H Tj H 
* • O 

rH iH • 
O O rH 
OOO 
^ M O 

o o * 

CM CN O 
XI <N 

*c x; xi 
0 



CM CM 

• * 

rH 

O rH 
O rH 

M 

cu a 



00 

jC sz 



CM 
CM 



o CO 

m cm m 

O rH O 
OOO 
^ ^ 

a & a 



o 

rH 

XX 
T5 
> 



o 



u 
u 



O 

m 

M 

n 

3 



CM 
* * 

o 

2 



O 
U3 

CO 



CO 
Pu 

cu 
a 




c*i inCTiTi iTr oi.irrT ini 11 rr 



WO 01/21650 



PCT/US00/25856 



16/24 



OjCOOiCU^XOCO Q Q 

lira 2 s teas tfc*i Hid s flEnraa ~ ~- 






CO f 



oo to co to co <a oj co co 

£aj [±j Li_i Cu"- Ll. [tj tlj Cij ft_r 






£> > M M M M > 

jo IKpijJ gj S^Hg4igU co 



. Jkb « il—™™,-™ 

j & tu ^ J ^ S 2 2 . 

>h [i] u h ^ h b3 i-3 a: PS 

E-o;>cozzcGWri:<£t£ 

H w w H ri, <T< co <C < co 
e> < etf Ui > Cju Cxj > 

td W E <o x a OS c£ O td 

in taenia lararaiararaBpiei 

u u'u 1P1P § 0) co to IPfP 

O) ^ O} U ^ ^ ^ ^ Cxi Cxj > 

""liaBIEir3E3H©BlSlEjlEaiail 



a: i en 





£h E hh J> ^ > > m 

n: td tonaiuuxns a, a. a. 
Q Cb X X!^ ^ u w 

CU CO O Oj CU CO CO Q 
Q ^ M l2 Z J iJ J 

UiOQOitfZZZ 

M>MM2>>> 
> M Cm Ui 



td 



co co o; 
> > 



o > > 2 

a; a: ^ o 
a > > *h > 

Cd raHEOTIKEEMl 
CO X i£ Cd M 

§co co cu cu 

mm Q Q CO CO 
O ^ CO Ofi 

^ z z a o 
g 2 h h 

> M 
CO OS 

. ^ O L> M H 
©82 Z ISUfcE 



UI Id 

> > a: a; 

Z Z ^ 

a: as tw 

Z Z 8h W 




IT) Q U ft > W 



O 

m 



i r i 



i 

CM td CO CO CO O Z 

□ ^ UJ UJ ^ s 

a 

M O M > > > 

H H U] U W 

< > > z z z 

O O U W CO CO 



■J 

CO 



td 

o 



I 

I 

I 

1 

z 



I M 

i e> 
z z 
z z 
a Qj u U o 
> 



co cj x a q 



z 



z 



2^ i-J ^ £*i ^£ 

>h a- ac > > 

OS Q o 

td td > ^ £h 

< > m < *c 

H CO H H > 



z 

U3 Cd ^ 

i i a 

1 I X 

CO CO CO 

CO CO CO 



CO 
M 



Eh 

s 

ETz til 



a a 

!>-* S>h 

a: a; a: ^ M 
En co cestje&isn co co co 

LDEhEhCOCOEhEhCO 




(J U H ^ > > > 

[uHH>^HHH 

> u* z > Z i-l ^ J 

EhEhEhZX>>> 
,JLyirf»JEHUUU* 



x a z 
m a ^ 

(SJfEBtEfl!£31 co 

S m h m i< 

^ W > W W 2 
*t > *t: *C 

OiJMDiEHMClQtJ 
(N>h>hO<<< 




r^oH\ra)CM(\jcM 



<N1 CM H H H rl rl (M 

" O O O O "XX X X 

oozzzzoooooo 
zz zzzzzz 
a o a a 

OQmhhmi-hOOOOQO 
MM MMMMMM 

o o o o 

OOtJU tJ tJOOOOOO 
WWCOCOCOCOCJpJCJpJCiJtd 
CO co ^ — CO CO CO CO CO CO 

o% o 

M CM OHHH(DfnfOOO^CM 

a; m - - . o ^ -h * • 

tO U O H H * • ♦ O CO * 

4J4JOOOOMMr-mcMin 

CO CO^JOOOOOfHOfHO 

>^ >h u * * ^ a a (X & Q« 

cm o o o 

g w X) N H o ^ H m 

-9 o <c x: si & <-* m to^ 
a> <u <i> -c "O n tj 
<d > s 



^1* QD VO 

CM CM CM 

6 6 O 

Z Z Z 

d a o 

M M M 

o a a 

to ui w 

CO CO CO 



M iH M M M CM 

o O O O - 

zzzzoooooo 
zzzzzz 

o a o o 

mmmmOOOOOO 

MMMMMM 

o o> a o 

COCOCOCOCdWCJCJUlCJ 
— ^ — — — 'COCOCOCOCOCO 



o% o r- 



CO H CM 

M 0) 

en co 

>* >* 



04 

3 



O M M 

in + • 
o a\ m 
o o o 
o o 
a-* * 
• a* ex 

o • * 
CM U O 
W M CM 

O JZ 



m oo co r> oo r* cm 

T3 M CM CM JZ JC Csl 

- O -H * * *w 

M • * • O 03 • 

o M m in cm m 

O O O fH O M o 

^ O O M O O O 

QtJ* J* M M J* M 

* Cu Q* Cu Q4 CX O- 



u 

CM 
-Q 

0) 



O O C M M O 

CM M O ^ H if) 

.O ,Q h M <rt 

JZ 13 U T> 

Q) > S 



CM CM CM rH 

o o o o - 

ooozzzzo 
z z z z 

O Q Q O 
OOOmmmmO 

M M M M 

0000 

U1WWCOCOCOCOUJ 
CO CO CO ' — ' ' — ' — ' — ' CO 



CO 

Cu 



u 

CO 

to 

0) 

>H 



CM 

Q) in 

4J 

to 
>• 



O M 

a 



O (TV M 

000 
o o 

* a. cx 

o 

CM 
CO 



iH OO 

T5 M 

- o 

M * 

O M 

o o 

M o 

a 

• a 



o 

iH CM 

en Xi 

U £ 

a a) 



o 

CM 

JC 

0) 



o 

CM 

Xfc 

x: 

0) 



CM 
I 




55IIR55T1TUTE 55HFFT mill F 



WO 01/21650 



17/24 



PCT/USOO/25856 




CQ Out £3li 

hi s s 

> > > 
H CO to 
tu Uj 

o > > 
j J j 

tC C3 CD 
WWW 
F> 6- H 

u s 

> J 

Q iC 
n W W 
v-\ (£, <£. 



hi 



h oo co 

00 H irt <J^ (O 




CO 

IE**! 

o 
mi 

< 

t 

i 

5©] 

Cm 

to 

DC 
CO 

to 

tea 



co 

o 

to 

I 

o 
s 



M 



hi „ 

a i i i u u u w ^ 
Hzzze^rfitf^co 

2HH>OtOCOO i 

^^w^^iHZ^^zi 
^ st: hi os oi 

^ ^ J 

Q CO Q 
< M 
hi el > 
Z 



CD CLi 

< l-H 
E-* U 

z 



CO hi J 

hi Z O . - 

£-i CO S<S Q Q 

M Cm 

Eh IH 



CM 



a: 



> > 

S3 ^ 



a 

CO CO CO 



hi hi 

a 
co < 

H Cu 
IH hi 
Z CO 



tOCOrHMHirfHrHCOtO 



Q O O O Z Q a 
lESRco co co FEU cn to 



CO CO CO 

CO CO CO 

s s 

z z 

[j w a 

H H > 

O 0> DU 



> 
CO 

z 



> > 

H H 

to CO 

co to 

o o o 

H S 

z 



z 



z 



CO CO 
CO 



Cx* 



hi 

CO CO CO 

a o a 
I t co 
i i < 

I I CO 



> > 

(3 iD 
Z Z 



E33 



z 

> > 

CO CO 



z 



a 



a 

Q O 
CO CO 

ex; 

o a 
hi hi 



Q 
1 
t 
I 

a: as 



a 



r^csjoDHiOooooo^fOific^^a) 

HNHOJHHHHCMHHCNH 



O Ci H W 

m n: to ^ 

ro c£ u< re 

Z k h 

roe co to 

>h s r 

H X 

CO hi 




os^cotococGf^Putfips 
:r: rr; o£ > > o£ o£ 

Z Z co O O to rf? 
w 2 co hi hi to S 

> > J H H J S 

h w H gC3S3^JgtMJ 

52 

teass3B2B^a toco (tD^ 

HHH fplg«^P& 

OOCOtOCOOCOCOa 





10 H CJ 

W CO hi 

> td CO 

U, H H 

CO Eh 

I < 




>« J _,^_, .„.. . 

CO IGQ>-« >* Ui co p£ 503 

COEHCuaiCuCuOiOiZEH 

tsaiea co co to ssi z z usatp 




CO CO hi HJh H CO hi H H hi T3" 

> ^mauBHOii hi icaissifHniizraicai 




o 



H CM CM W H 





e; 






CO 


CO 


CO 


CO 


















o 


en 






00 


m 


n 


m 


rH 


CM 


CM 


CM 




m vd o 

Osl CM CM 



CM 






CO 


o 


rH 




rH 


tH 


CM 


o 


o 


O 


O 


o 


z 


z 


z 


z 


Z 


Q 


a 


o 


a 


a 


H 


\—{ 


M 




VH 




o 


o 


O o 


w 


CO 


CO 


CO 


CO 


CO 


CO 


to 


CO 


CO 












m 


CO 


CD 




CM 


CM 


CM 


x: 




CM 






» 


* 




• 


* 


o 


oo 


• 


tH 




to 


CM 


m 


o 


tH 


o 


tH 


o 


o 


r-i 


o 


o 


o 












a 


a 


a 


a a 


o 


c 


rH 


tH 


o 


tH 


o 


U 


rH 


m 






u 


cn 












-a 



o 
z 



00 cm 

CM CM CM " " 
O O 

o o o z z 

Z Z Z _ _ ^ 

O Q O 

Q O O M rH M 

HHH _ 

o o o 

o O O CO CO CO 
CO CO to CO CO CO 
CO CO CO " ^ 



r- o> o 



CO 



CQ 
CO 



CM 

a) 

M 
(0 

cd 



CL-H 



in 
o 
o 



<J\ rH 
O O 

O o 
CUM X 
* cu Cu 
u * * 
CM U O 

C/l H N 

e co xi 
-Box: 
<u a> 



OT O 


CM 




\o 


GO 


o 




rH 


rH 


rH 


*H 


CM 


z o 


O 


o 


o 


O 


O 


z 


z 


z 


z 


z 


z 


a 

H Q 


a 


Q 


o 


o 


a 


H 


rH 


rH 


rH 


w 


M 


woo 


o o o o 


CO CO 


CO 


to 


to 


to 


CO 


— ' CO 


CO 


CO 


CO 


CO 


to 














r- 












rH CX> 


n 


ro 


CD 




CM 


XJ rH CM 


CM 






CM 


• o 






• 






rH • 




• 


o 


rjo 


* 


O rH 


rH 


r- 


in 


CM 




O O 


o 


rH 


o 


rH 


o 


J* o 


o 


rH 


O 


o 


o 




M 




M 




M 


* a 


a 


0* CL 


Cu 












u 


CM O 


o 


C 


rH 


rH 


XI tN 


rH 


o 




rH 


in 


X XI 


€ 


rH 




CO 


M 


<d x: 




1H 






TJ 




> 








5 



^ 00 CM 
CM CM CM 
O 

o o o z 
z z z 

a 

q a q h 

HHH 

a 

O* C O co 
to CO CO CO 
to CO co — 



^ K£> 03 O CM 



o o o 
z z z 

o a q 



o o o 

CO CO CO 
CO CO CO 



^ u) m o 

H H rH CM 



o o o o o o 

z z z z z z 

Q O O Q Q O 

H H H rH H H 

oaoaoo 

to CO CO co to CO 

CO CO co to CO CO 



rH CM 

u u 

u o 

to u 

+J +J 

co to 

s >■ >■ 



to 

rH 



U rH 

* a 

m « 

O CTk 

o o 

M O 
CL 

• a 

o * 

CM O 
CO H 

e to 
(1) 



o r- 

rH H 

■H 13 



oo m m co cm 

H CM CM ^ Jd CM 

* • O *H * • *W 
H H - * • O OO 

ooHrHr—mcMtn 

OOOOHOHO 
Jki^OOrHOOO 

* * cx cx cu ex a, 
u u 

CMCMOOdrHrHO 

XI cm h o M h m 
<1J > 5 




OI IOCTITI ITC CUCCT /OI II 



WO 01/21650 



PCT/US00/25856 



18/24 



CP* 



1 CO 


I 


I Q 


i 


1 O 


i 


l H 


i 


l > 


l 


1 CO 




1 > 


i 


I CO 


i 


\ HH 

1 Oi 


i 


1 Cu 


i 




i 


1 cu 


l 


\ Ui 


I 


J % 


i 


1 >* 


i 






1 CO 


i 






i 3 








1 b£ 




1 ^ 


CO 


t 1£ 




I ^ 




1 


Q 


1 M 


M 


i co 


CU 


1 


Q 


r > 


O 


1 CO 


2: 


f w 




1 X 


CO 


1 M 


10 


1 


a 


1 CO 




I Uj 


a 


I J 


o 


1 CO 


CO 


\ ^ 


£0 








o: 


1 W 






*0 


CD >h 




C5 & 


2 


Cju O ^ 


a: *h 


C» 


in <r> 





o 



•J J 

re x 



o 

J I < 
>* I ^ 

^ z oi 
£ (/) 
^ (J O 
W O Id 
M 
OS CO 



g 

a: 
to 

a o a: 
woo 

h > [u 

cn a: a: 



k3 

s 

CO % 

a; a; 



CMCMCMCMCMCMCMcgcncgegncg 





00 




eg 




GO 


eg 


CM 


CM 












O 


O O 


o 


O 


O 


o 


z; 




z: 






2: 
















o a 


a 


a 


a 


o 


IH 




M 




IH 


M 


o 


o a 


a 


o 


a a 


U3 


CO to 


w 


W 


w 


w 


CO 


CO CO 


CO 


CO 


to 


CO 




























cn o 


r- 


CO 


tH 


CM 


a 


iH rH 


iH 


Oi 


JJ 


M 


• 






Cu 


U 


a> 


IT) 


* ■ 


« 




CO 




o 




iH 




±J 




o 


O O 


O 


D 


to 


cn 


J* 


o o 


O 


rH 




id 


a 




M 






0) 


■ 




a 








0 


• * 


• 








CM 


u o 


o 








(0 


*h eg 


CM 










to JD 










i 


o x: 


x: 















o oj od o 

H H H H H CN| 

OOOOod 
% Z 2 2 21 2 

a a a a a a 

M HH M W IH Hi 
OOOOOO 

to til w w w w 

CO CO CO CO CO CO 



r-t <N 

o ^ 



o o 
o o 

a a 



cn cd r- cm 

eg x: x: cm 

-H • * *M 

• o 00 * 

r^- in (N in 

iH O rH O 

tH O O O 

^ ^ ^; 

a a a a 



o 

CM 

u 



o 

tH 

Si 

> 



o 



ih a 
h m 

to M 
T3 




SUBSTITUTE SHEET (RULE 26) 



WO 01/21650 



PCT/US00/25856 



19/24 



Wild Type 




TIME (min) 



FIG. 5A 



35S::Hpt3 

1000 -i 
800 ■ 




0 5 10 15 20 



TIME (min) 

i 

FIG. 5B 



SUBSTITUTE SHEET (RULE 261 



01/21650 



20/24 



PCT/US00/25856 



35S: :rr1 




-100 -I 1 1 1 1 

0 5 10 15 20 

TIME (min) 



FIG. 5C 

35S::Apt5 




-100 -i 1 1 1 ■ 

0 5 10 15 20 

TIME (min) 



FIG. 5D 

SUBSTITUTE SHEET (RULE 26) 



WO 01/21650 



PCT/US00/25856 



21/24 



35S::sl1 



DAD 
S i gno l 




o 



5 10 15 

TIME (min) 



FIG. 5E 



SUBSTITUTE SHEET (RULE 261 



01/21650 



PCT/US00/25856 



22/24 



15000 



Wild Type 



MSD 

Signal 



10000 



5000 ■ 



0 



iiAli^nqd 



0 



5 10 15 

TIME (min) 



20 



FIG. 6A 



1 5000 i 



MSD 

Signal 



1 0000 • 



5000 



0 



35S::Hpt3 



fit lUKI (rtlr^ 



0 



5 10 15 

TIME (min) 



20 



FIG. 6B 



SUBSTITUTE SHEET (RULE 26) 



WO 01/21650 



PCT/US00/25856 



23/24 




FIG. 6C 



1000 i 



35S::Ap+5 



MSD 

Signal 



750 



500 - 



250 " 



0 




0 



5 10 15 20 

TIME (mln) 



FIG. 6D 



SUBSTITUTE SHEET (RULE 26) 



01/21650 PCT/US00/25856 

24/24 



35S::sl1 




TIME (min) 

FIG. 6E 



SUBSTITUTE SHEET (RULE 26) 



WO 01/21650 



PCT/US00/25856 



SEQUENCE LISTING 

<110> E.I. du Pont de Nemours and Company 

<120> cis-Prenyltransf erases from Plants 

<130> BC1019 PCT 

<140> 
<141> 

<150> 60/155,046 

<151> 1999-09-21 

<160> 37 

<170> Microsoft Office 97 

<210> 1 
<211> 1388 
<212> DNA 

<213> Dimorphotheca 
<400> 1 

ggcacgacag gtttcccgac tggaaagcgg gcagtgagcg caacgcaatt aatgtgagtt 60 

agctcactca ttaggcaccg caggctttac actttatgct tccggctcgt atgttgtgtg 120 

gaattgtgag cggataacaa tttcacacag gaaacagcta tgaccatgat tacgccaagc 180 

gcgcaattaa ccctcactaa agggaacaaa aggctggagc tccaccgcgg tggcggccgc 240 

tctagaacta gtggatcccc cgggctgcag gaattcggca cgagcttaaa taatgcttaa 300 

tcttcccctc tacttaccca aatatccttg ttatttcccg gcctctctct ccaccaacca 360 

ccaccgtggt ctttatgtat tcaaccaatc agacaccact ggaggtggaa ttaattcgct 4 20 

ggaggaacgc attactccag caggactcaa gcacgagtta atgccaaagc atgtggcagt 480 

gatcatggat ggaaacagga gatgggctcg atcacgtggg ttaatgccgg atgctggtta 54 0 

catggaaggt gcacgctcat tgaaggtgat ggtggaattg tgtcgtaaat ggggaattca 600 

agtccttact gtgtttgcct tctcagctga taactggtta agacccaaag ttgaagttga 660 

tttcttgatg ggactaattg aaagtgtatt aaaagatgaa gttgttcata tgatcaaaga 720 

gggtatccag ctttcggtta tcggagacac atctaagctt ccaaaatcgg taaaacggat 780 

cattacatat gctgaaaaca tcacgaagaa caactcacaa ctcaatcttg ttgtagcaat 840 

aaattatagt ggaaaatatg atatcgtcca agcttgtcaa agcatcgcac taaaagtcaa 900 

agacggtgtc attcaacccg aagaaatcaa tgagtttacg attgaaaatg aacttggtac 960 

aaattgtatt ccttttccac accctgatct actaattcgg actagtgggg agcttagagt 1020 

gagcaacttc tttttgtggc aattggcgta tactgaatta tacttcagtg aaactctttg 1080 

gcctgatttt ggtgaagatg aacttttaca tgctttaaat acttttcaac atagacgaag 1140 

acgttatggt ggatgagatt cttaaacaac cctgtagagt tgcatatcat attgactttt 1200 

gatatgtttc aatactattt atattattat tatgttgtaa tatcgtacta gaacatgaat 1260 

ttaaataggc aatagagcat gccacctaat atgtctagtt atgagattct aaagacgtaa 1320 

ttatgcttac ctaaaagaaa atatatatga agagaaaagc ttatgtaaaa aaaaaaaaaa 1380 

aaaaaaaa 1388 



<210> 2 
<211> 287 
<212> PRT 

<213> Dimorphotheca 
<400> 2 

Met Leu Asn Leu Pro Leu Tyr Leu Pro Lys Tyr Pro Cys Tyr Phe Pro 
15 10 15 

Ala Ser Leu Ser Thr Asn His His Arg Gly Leu Tyr Val Phe Asn Gin 

20 25 30 

Ser Asp Thr Thr Gly Gly Gly lie Asn Ser Leu Glu Glu Arg lie Thr 
35 40 45 
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Pro Ala Gly Leu 
50 

Met Asp Gly Asn 
65 

Ala Gly Tyr Met 



Cys Arg Lys Trp 

100 

Asp Asn Trp Leu 
115 

He Glu Ser Val 
130 

He Gin Leu Ser 
145 

Lys Arg He He 



Leu Asn Leu Val 

180 

Gin Ala Cys Gin 
195 

Pro Glu Glu He 
210 

Cys He Pro Phe 
225 

Leu Arg Val Ser 



Tyr Phe Ser Glu 

260 

His Ala Leu Asn 
275 



Lys His Glu Leu 

55 

Arg Arg Trp Ala 
70 

Glu Gly Ala Arg 
85 

Gly lie Gin Val 



Arg Pro Lys Val 

120 

Leu Lys Asp Glu 
135 

Val He Gly Asp 
150 

Thr Tyr Ala Glu 
165 

Val Ala He Asn 



Ser lie Ala Leu 

200 

Asn Glu Phe Thr 
215 

Pro His Pro Asp 
230 

Asn Phe Phe Leu 
245 

Thr Leu Trp Pro 



Thr Phe Gin His 

280 



Met Pro Lys His 

60 

Arg Ser Arg Gly 

75 

Ser Leu Lys Val 
90 

Leu Thr Val Phe 
105 

Glu Val Asp Phe- 



Val Val His Met 

140 

Thr Ser Lys Leu 
155 

Asn He Thr Lys 
170 

Tyr Ser Gly Lys 
185 

Lys Val Lys Asp 



lie Glu Asn Glu 

220 

Leu Leu He Arg 
235 

Trp Gin Leu Ala 
250 

Asp Phe Gly Glu 
265 

Arg Arg Arg Arg 



Val Ala Val He 



Leu Met Pro Asp 

80 

Met Val Glu Leu 
95 

Ala Phe Ser Ala 
110 



Leu Met Gly Leu 
125 



He Lys Glu Gly 



Pro Lys Ser Val 

160 

Asn Asn Ser Gin 
175 

Tyr Asp He Val 
190 



Gly Val He Gin 
205 



Leu Gly Thr Asn 



Thr Ser Gly Glu 

240 

Tyr Thr Glu Leu 
255 

Asp Glu Leu Leu 
270 

Tyr Gly Gly 
285 



<210> 
<211> 
<212> 
<213> 



3 

1082 
DNA 

Calendula 



officinalis 



<400> 3 

atgacattat 

ccaactctaa 

ttacaaatat 

gaagtagaat 

atggatggaa 

gccatgagaa 

gtatcgattt 

ctaatggaga 

tgtcgagtaa 

atcgaaatag 

tacagtggaa 



tttccctaat 
tttcttcaac 
atcaacggtt 
taccaggggg 
accgtcgatg 
agacgcttca 
atgcattttc 
tgtatgaaga 
gcataatggg 
aagaaaaatc 
aatacgacat 



tactcaatta 
cgcgtgtcac 
ttgagcaatg 
tctcgaagaa 
ggcggtggaa 
atctctcctt 
taccgaaaat 
tttattgagg 
gaaaaagacc 
aagagccaat 
aatcgaagct 



aaccttgttt 
caataacttc 
aaaataccaa 
gaactaatgc 
aaaggttggt 
tttcgatgtt 
tggactcgcc 
acagatgctg 
aaccttccga 
tcaggaaccc 
tgtaaaagcg 



agctcctaaa 
ggggataatt 
actgaaaacc 
caaaacacgt 
ctccaatgac 
ccaaattcaa 
cgaaggaaga 
aggagctctt 
aatcactaca 
atgttaacta 
tcgctacaaa 



ccacactctt 60 

cgttcatcga 120 

aaaaaaagaa 180 

tgcattcata 240 

gggtcatagt 300 

aatcaaagcg 360 

agttgatttc 420 

aagtcttggt 480 

aaagttatgc 540 

tgcactcaac 600 

agtcaaggat 660 
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ggtgttatta ttccaaaaca gatcgacgaa aaatatttca aacaagaact cggtaccaaa 720 

atgatcgatt ttccttaccc tgacctagtt atacgtacaa gcggggaaat taggcttagt 780 

aatttcatgc tatggcagat ggcgtatagc gagctttatt tcacggataa atactttccg 84 0 

gattttgggg aaaatgatct tatcgaggct ttacttgcat ttcaaaaagt gcgtaaatgt 900 

taataacttg ttgtggttaa gacgagtgtg gtagaatatc aataaatgac tcgtttcggc 960 

ggcgttgtgt atgccacatt atatgtctta gtgtctatca gaattcgaat ttgatttata 1020 

gtcgcttgag atatgaaaac ttattatatt tgttcgatca aaaaaaaaaa aaaaaaaaaa 1080 

aa 1082 



<210> 4 
<211> 228 
<212> PRT 
<213> Calendula 

<400> 4 

Met Pro Lys His 
1 

Val Glu Lys Gly 

20 

Thr Leu Gin Ser 
35 

Val Ser lie Tyr 
50 

Glu Val Asp Phe 
65 

Ala Glu Glu Leu 



Lys Thr Asn Leu 

100 

Glu Lys Ser Arg 
115 

Tyr Ser Gly Lys 
130 

Lys Val Lys Asp 
145 

Phe Lys Gin Glu 



Leu Val lie Arg 

180 

Trp Gin Met Ala 
195 

Asp Phe Gly Glu 
210 

Val Arg Lys Cys 
225 

<210> 5 
<211> 1071 



officinalis 



Val Ala Phe lie 
5 

Trp Ser Pro Met 



Leu Leu Phe Arg 

40 

Ala Phe Ser Thr 
55 

Leu Met Glu Met 
70 

Leu Ser Leu Gly 
85 

Pro Lys Ser Leu 



Ala Asn Ser Gly 

120 

Tyr Asp lie lie 
135 

Gly Val lie He 
150 

Leu Gly Thr Lys 
165 

Thr Ser Gly Glu 



Tyr Ser Glu Leu 

200 

Asn Asp Leu He 
215 



Met Asp Gly Asn 
10 

Thr Gly His Ser 
25 

Cys Ser Lys Phe 



Glu Asn Trp Thr 

60 



Tyr Glu Asp Leu 
75 

Cys Arg Val Ser 
90 

Gin Lys Leu Cys 
105 

Thr His Val Asn 



Glu' Ala Cys Lys 

140 

Pro Lys Gin He 
155 

Met lie Asp Phe 
170 



He Arg Leu Ser 
185 

Tyr Phe Thr Asp 



Glu Ala Leu Leu 

220 



Arg Arg Trp Ala 
15 

Ala Met Arg Lys 
30 

Lys He Lys Ala 
45 

Arg Pro Lys Glu 



Leu Arg Thr Asp 

80 

He Met Gly Lys 
95 

He Glu He Glu 
110 

Tyr Ala Leu Asn 
125 

Ser Val Ala Thr 



Asp Glu Lys Tyr 

160 

Pro Tyr Pro Asp 
175 

Asn Phe Met Leu 
190 

Lys Tyr Phe Pro 
205 

Ala Phe Gin Lys 
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<212> DNA 

t 

<213> Hevea brasiliensis 
<400> 5 

tctcattcga gtgctcaagt tgcaaaccac ttttgatttt ggaggattta ccgagtcacc 60 

tacaggcttc gggttaaagc atcgtgatgt gggtttaagg aaatggaatt atataccagt 120 

taagtcagtg atttaaggaa aatggaatta tacaacggtg agaggccaag tgtgttcaga 180 

cttttaggga agtatatgag aaaagggtta tatagcatcc taacccaggg tcccatccct 240 

actcatattg ccttcatatt ggatggaaac aggaggtttg ctaagaagca taaactgcca 300 

gaaggaggtg gtcataaggc tggattttta gctcttctga acgtactaac ttattgctat 360 

gagttaggag tgaaatatgc gactatctat gcctttagca tcgataattt tcgaaggaaa 420 

cctcatgagg ttcagtacgt aatggatcta atgctggaga agattgaagg gatgatcatg 480 

gaagaaagta tcatcaatgc atatgatatt tgcgtacgtt ttgtgggtaa cctgaagctt 540 

ttaagtgagc ccgtcaagac cgcagcagat aagattatga gggctactgc caacaattcc 600 

aaatgtgtgc ttctcattgc tgtatgctat acttcaactg atgagatcgt gcatgctgtt 660 

gaagaatcct ctgaattgaa ctccaatgaa gtttgtaaca atcaagaatt ggaggaggca 720 

aatgcaactg gaagcagtac tgtgattcaa actgagaaca tggagtcgta ttctggaata 780 

aaacttgtag accttgagaa aaacacctac ataaatcctt atcctgatgt tctgattcga 840 

acttctgggg agacccgtct gagcaactac ttactttggc agactactaa ttgcatactg 900 

tattctcctt atgcactgtg gccagagatt ggtcttcgac acgtggtgtg gtcagtaatt 960 

aacttccaac gtcattattc ttacttggag aaacataagg aatacttaaa ataatttggt 1020 

tctgttccta gctcatcctg ccttattccg ataggttaag cttaagcata t 1071 

<210> 6 
<211> 290 
<212> PRT 

<213> Hevea brasiliensis 
<400> 6 

Met Glu Leu Tyr Asn Gly Glu Arg Pro Ser Val Phe Arg Leu Leu Gly 
15 10 15 

Lys Tyr Met Arg Lys Gly Leu Tyr Ser lie Leu Thr Gin Gly Pro lie 

20 25 30 

Pro Thr His lie Ala Phe lie Leu Asp Gly Asn Arg Arg Phe Ala Lys 
35 40 45 

Lys His Lys Leu Pro Glu Gly Gly Gly His Lys Ala Gly Phe Leu Ala 
50 55 60 

Leu Leu Asn Val Leu Thr Tyr Cys Tyr Glu Leu Gly Val Lys Tyr Ala 
65 70 75 80 

Thr He Tyr Ala Phe Ser He Asp Asn Phe Arg Arg Lys Pro His Glu 

85 90 95 

Val Gin Tyr Val Met Asp Leu Met Leu Glu Lys He Glu Gly Met He 

100 105 110 

Met Glu Glu Ser He He Asn Ala Tyr Asp He Cys Val Arg Phe Val 
115 120 125 

Gly Asn Leu Lys Leu Leu Ser Glu Pro Val Lys Thr Ala Ala Asp Lys 
130 135 140 

He Met Arg Ala Thr Ala Asn Asn Ser Lys Cys Val Leu Leu He Ala 
145 150 155 160 

Val Cys Tyr Thr Ser Thr Asp Glu He Val His Ala Val Glu Glu Ser 

165 170 175 

Ser Glu Leu Asn Ser Asn Glu Val Cys Asn Asn Gin Glu Leu Glu Glu 

180 185 190 
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Ala Asn Ala Thr Gly Ser Ser Thr Val He Gin Thr Glu Asn Met Glu 
195 200 205 

Ser Tyr Ser Gly He Lys Leu Val Asp Leu Glu Lys Asn Thr Tyr He 
210 215 220 

Asn Pro Tyr Pro Asp Val Leu He Arg Thr Ser Gly Glu Thr Arg Leu 
225 230 235 240 

Ser Asn Tyr Leu Leu Trp Gin Thr Thr Asn Cys He Leu Tyr Ser Pro 

245 250 255 

Tyr Ala Leu Trp Pro Glu He Gly Leu Arg His Val Val Trp Ser Val 

260 265 270 

He Asn Phe Gin Arg His Tyr Ser Tyr Leu Glu Lys His Lys Glu Tyr 
275 280 285 

Leu Lys 
290 

<210> 7 
<211> 1000 
<212> DNA 

<213> Hevea brasiliensis 
<400> 7 

cgggttaagt cagtgattta aggaaaatgg aattatacaa cggtgagagg ccaagtgtgt 60 

tcagactttt agagaagtat atgagaaaag ggttatatag catcctaacc cagggtccca 120 

tccctactca tattgccttc atattggatg gaaacaggag gtttgctaag aagcataaac 180 

tgccagaagg aggtggtcat aaggctggat ttttagctct tctgaacgta ctaacttatt 240 

gctatgagtt aggagtgaaa tatgcgacta tctatgcctt tagcatcgat aattttcgaa 300 

ggaaacctca tgaggttcag tacgtaatgg atctaatgct ggagaagatt gaagggatga 360 

tcatggaaga aagtatcatc aatgcatatg atatttgcgt acgttttgtg ggtaacctga 420 

agcttttaag tgagccagtc aagaccgcag cagataagat tatgagggct actgccaaca 4 80 

attccaaatg tgtgcttctc attgctgtat gctatacttc aactgatgag atcgtgcatg 54 0 

ctgttgaaga atcctctgaa ttgaactcca atgaagtttg taacaatcaa gaattggagg 600 

aggcaaatgc aactggaagc agtactgtga ttcaaactga gaacatggag tcgtattctg 660 

gaataaaact tgtagacctt gagaaaaaca cctacataaa tccttatcct gatgttctga 720 

ttcgaacttc tggggagacc cgtctgagca actacttact ttggcagact actaattgca 780 

tactgtattc tccttatgca ctgtggccag agattggtct tcgacacgtg gtgtggtcag 840 

taattaactt ccaacgtcat tattcttact tggagaaaca taaggaatac ttaaaataat 900 

ttgtttctgt tcctagctca tcctgcctta ttcgcgatag ttaagcttaa gcatatcctt 960 

gtggaataaa ctcggacact taattaagcc ggtattttgt 1000 

<210> 8 
<211> 290 
<212> PRT 

<213> Hevea brasiliensis 
<400> 8 

Met Glu Leu Tyr Asn Gly Glu Arg Pro Ser Val Phe Arg Leu Leu Glu 
15 10 15 

Lys Tyr Met Arg Lys Gly Leu Tyr Ser He Leu Thr Gin Gly Pro He 

20 25 30 

Pro Thr His He Ala Phe He Leu Asp Gly Asn Arg Arg Phe Ala Lys 
35 40 45 

Lys His Lys Leu Pro Glu Gly Gly Gly His Lys Ala Gly Phe Leu Ala 
50 55 60 
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Leu Leu Asn Val Leu Thr Tyr Cys Tyr Glu Leu Gly Val Lys Tyr Ala 
65 70 75 80 

Thr lie Tyr Ala Phe Ser He Asp Asn Phe Arg Arg Lys Pro His Glu 

85 90 95 

Val Gin Tyr Val Met Asp Leu Met Leu Glu Lys He Glu Gly Met He 

100 105 110 

Met Glu Glu Ser He He Asn Ala Tyr Asp He Cys Val Arg Phe Val 
115 120 125 

Gly Asn Leu Lys Leu Leu Ser Glu Pro Val Lys Thr Ala Ala Asp Lys 
130 135 140 

He Met Arg Ala Thr Ala Asn Asn Ser Lys Cys Val Leu Leu He Ala 
145 150 155 160 

Val Cys Tyr Thr Ser Thr Asp Glu He Val His Ala Val Glu Glu Ser 

165 170 175 

Ser Glu Leu Asn Ser Asn Glu Val Cys Asn Asn Gin Glu Leu Glu Glu 

180 185 190 

Ala Asn Ala Thr Gly Ser Ser Thr Val He Gin Thr Glu Asn Met Glu 
195 200 205 

Ser Tyr Ser Gly He Lys Leu Val Asp Leu Glu Lys Asn Thr Tyr He 
210 215 220 

Asn Pro Tyr Pro Asp Val Leu He Arg Thr Ser Gly Glu Thr Arg Leu 
225 230 235 240 

Ser Asn Tyr Leu Leu Trp Gin Thr Thr Asn Cys He Leu Tyr Ser Pro 

245 250 255 

Tyr Ala Leu Trp Pro Glu He Gly Leu Arg His Val Val Trp Ser Val 

260 265 270 

He Asn Phe Gin Arg His Tyr Ser Tyr Leu Glu Lys His Lys Glu Tyr 
275 280 285 

Leu Lys 
290 

<210> 9 
<211> 1000 
<212> DNA 

<213> Hevea brasiliensis 
<400> 9 

ccgagtcacg tataggcttc gtgtgaaggt taagtcagtt tagcatcggg atttgggttt 60 

aaggaaaatg gaaatatata cgggtcagag gccaagtgtg tttagaattt ttgggaaata 120 

catgagaaaa gggttatata gcatcctaac ccaaggtccc atccctactc atcttgcctt 180 

cataatggat ggaaaccgga ggtttgctaa gaagcacaaa atgaaagaag cagaaggtta 24 0 

taaggcagga tatttagctc ttctgagaac actaacttat tgctatgagt tgggagtgag 300 

gtatgtaacc atttatgcct ttagcattga taattttcga aggcaacctc gtgaggttca 360 

gtgcgtaatg aatctaatga tggagaagat tgaagagatt atcgtggaag aaagtatcat 420 

gaatgcatat gatgttggcg tacgtattgt gggtaacctg aatcttttag atgagccaat 480 

caggatcgca gcagaaaaga ttatgagggc tactgccaat aattccgggt ttgtgcttct 540 

cattgctgta gcctatagtt caactgatga gatcgggcat gctgttgaag aatcctctaa 600 

agacaaattg aactccaatg aagtttgcaa caatgggatt gaagctgaac aggaatttaa 660 

ggaggcaaac ggaaccggaa acagtgtgat tccagttcag aagacggagt catattctgg 720 

aataaatctt gcagaccttg agaaaaacac ctacgtaaat cctcatcctg atgtcttgat 780 
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tcgaac'ttct gggttgagcc gtctaagtaa ctacctactt tggcagacta gtaattgcat 840 

actgtattct ccttttgcac tgtggccaga gattggtctc aggcacttgg tatggacagt 900 

aatgaacttc caacgtcatc attcttattt ggagaagcat aaggaatatt taaaataatt 960 

tatttttgtt cctaactcat cctgccttat tcgggataga 1000 

<210> 10 
<211> 296 
<212> PRT 

<213> Hevea brasiliensis 
<400> 10 

Met Glu lie Tyr Thr Gly Gin Arg Pro Ser Val Phe Arg lie Phe Gly 
15 10 15 

Lys Tyr Met Arg Lys Gly Leu Tyr Ser lie Leu Thr Gin Gly Pro lie 

20 25 30 

Pro Thr His Leu Ala Phe lie Met Asp Gly Asn Arg Arg Phe Ala Lys 
35 40 45 

Lys His Lys Met Lys Glu Ala Glu Gly Tyr Lys Ala Gly Tyr Leu Ala 
50 55 60 

Leu Leu Arg Thr Leu Thr Tyr Cys Tyr Glu Leu Gly Val Arg Tyr Val 
65 70 75 80 . 

Thr lie Tyr Ala Phe Ser lie Asp Asn Phe Arg Arg Gin Pro Arg Glu 

85 90 95 

Val Gin Cys Val Met Asn Leu Met Met Glu Lys lie Glu Glu lie lie 

100 105 110 

Val Glu Glu Ser He Met Asn Ala Tyr Asp Val Gly Val Arg He Val 
115 120 125 

Gly Asn Leu Asn Leu Leu Asp Glu Pro He Arg He Ala Ala Glu Lys 
130 135 140 

He Met Arg Ala Thr Ala Asn Asn Ser Gly Phe Val Leu Leu lie Ala 
145 . 150 155 . 160 

Val Ala Tyr Ser Ser Thr Asp Glu He Gly His Ala Val Glu Glu Ser 

165 170 175 

Ser Lys Asp Lys Leu Asn Ser Asn Glu Val Cys Asn Asn Gly He Glu 

180 185 190 

Ala Glu Gin Glu Phe Lys Glu Ala Asn Gly Thr Gly Asn Ser Val He 
195 200 205 

Pro Val Gin Lys Thr Glu Ser Tyr Ser Gly He Asn Leu Ala Asp Leu 
210 215 220 

Glu Lys Asn Thr Tyr Val Asn Pro His Pro Asp Val Leu He Arg Thr 
225 230 235 240 

Ser Gly Leu Ser Arg Leu Ser Asn Tyr Leu Leu Trp Gin Thr Ser Asn 

245 250 255 

Cys He Leu Tyr Ser Pro Phe Ala Leu Trp Pro Glu He Gly Leu Arg 

260 265 270 

His Leu Val Trp Thr Val Met Asn Phe Gin Arg His His Ser Tyr Leu 
275 280 285 
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Glu Lys His Lys Glu Tyr Leu Lys 
290 295 



<210> 11 
<211> 1232 
<212> DNA 
<213> Vitis sp 

<400> 11 

gagaaacatt atcctaaccc tagtcctgaa actcctgata atgctctctt ttcgatttcc 60 
aatttcagct gataacgctc gccatacttt caagtccaaa cactcttctt gtacttttcg 120 
aagtaacaga atcgattcat tttcttttcc tccaatctca gttcccagat ttcacaaact 180 
tcgcacagct aaaactgatg tagttgggga agaagaagca agagaagtaa acgagagagc 240 
ggaggaattt ccggacggtc ttcggagaga actgatgccg gaacacgtgg ccgtcattat 300 
ggacgggaac gtgaggtggg cacagaagag ggggttgccg gcggcgtcgg gtcaccaagc 360 
aggtgtgagg tcgttgagag agctggtgga gctctgttgc aaatggggga tcaaagttct 420 
ctcggttttc gcattttcct atgataattg gtctcgttcc gaaggggagg ttggttttct 480 
tatgagcttg atcgaaagag tggtcaaagc tgagctgcca attttgggag ggaaggcatt 540 
cgagtgtcgt gattggggat ttgtcaaagc ttctgagcaa ctgcaactga taattgatgt 600 
agaggagacc actaaggaga actcgcgatt acagttcatt gtggcactta gctatagtgg 660 
gcagtgtgac atactacaag catgcaaaaa cattggtcac aaagtaaagg atggccttat 720 
cgaaccggaa gacatcaaca aaagcctaat tgaacaggag ctacagacaa actgtactga 7 80 
atttcccttc cctgatctac ttatacgaac tagtggcgaa cttagagtca gcaatttcat 840 
gttgtggcaa atagcctaca ctgaactttg cttttttagc acactgtggc ctgattttgg 900 
gaaggatgag tttgtggagg ccttaagttc ttttcagaaa aggcagagac gatatggtgg 960 
gcgaaactga gtttactaat tacatataga tccccaactt ctgctccatt catatggaga 1020 
acttgtatac cattatatga agttaaattc ctgagaattc acttattaca cacagatccc 1080 
caacctatac tccattcata tggaaaactt gtaccattat atgaaactca ttcttcagaa 114 0 
gggaactgat cataccctgc ttccaagttt taagcatgaa gtgccttgcc atttatatac 1200 
atacttttac ttcaaaaaaa aaaaaaaaaa aa 1232 



<210> 12 
<211> 309 
<212> PRT 
<213> Vitis sp 

<400> 12 

Met Leu Ser Phe Arg Phe Pro He Ser Ala Asp Asn Ala Arg His Thr 
15 10 15 

Phe Lys Ser Lys His Ser Ser Cys Thr Phe Arg Ser Asn Arg He Asp 

20 25 30 

Ser Phe Ser Phe Pro Pro He Ser Val Pro Arg Phe His Lys Leu Arg 
35 40 45 

Thr Ala Lys Thr Asp Val Val Gly Glu Glu Glu Ala Arg Glu Val Asn 
50 55 60 

Glu Arg Ala Glu Glu Phe Pro Asp Gly Leu Arg Arg Glu Leu Met Pro 
65 70 75 80 

Glu His Val Ala Val He Met Asp Gly Asn Val Arg Trp Ala Gin Lys 

85 90 95 

Arg Gly Leu Pro Ala Ala Ser Gly His Gin Ala Gly Val Arg Ser Leu 

100 105 110 

Arg Glu Leu Val Glu Leu Cys Cys Lys Trp Gly He Lys Val Leu Ser 
115 120 125 



8 
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Val Phe Ala Phe Ser Tyr Asp Asn Trp Ser Arg Ser Glu Gly Glu Val 
130 135 140 

Gly Phe Leu Met Ser Leu lie Glu Arg Val Val Lys Ala Glu Leu Pro 
145 150 155 160 

He Leu Gly Gly Lys Ala Phe Glu Cys Arg Asp Trp Gly Phe Val Lys 

165 170 175 

Ala Ser Glu Gin Leu Gin Leu He He Asp Val Glu Glu Thr Thr Lys 

180 185 190 

Glu Asn Ser Arg Leu Gin Phe He Val Ala Leu Ser Tyr Ser Gly Gin 
195 200 205 

Cys Asp He Leu Gin Ala Cys Lys Asn He Gly His Lys Val Lys Asp 
210 215 220 

Gly Leu He Glu Pro Glu Asp He Asn Lys Ser Leu He Glu Gin Glu 
225 230 235 240 

Leu Gin Thr Asn Cys Thr Glu Phe Pro Phe Pro Asp Leu Leu He Arg 

245 250 255 

Thr Ser Gly Glu Leu Arg Val Ser Asn Phe Met Leu Trp Gin He Ala 

260 265 270 

Tyr Thr Glu Leu Cys Phe Phe Ser Thr Leu Trp Pro Asp Phe Gly Lys 
275 280 285 

Asp Glu Phe Val Glu Ala Leu Ser Ser Phe Gin Lys Arg Gin Arg Arg 
290 295 300 

Tyr Gly Gly Arg Asn 
305 



<210> 13 
<211> 1021 
<212> DNA 

<213> Oryza sativa 



60 



<400> 13 

acgcacgagc ttacacgcaa atgcattgta gctgtcctct cgtatggccc aatgcctaag 

catattgcat ttattatgga tggtaaccgt agatatgcta aattcaggag tatccaggaa 120 

ggctctggtc acagggtggg cttctctgct ctcattgcca gcctgctcta ctgctatgaa 180 

atgggcgtga agtatatcac ggtgtatgca tttagcatcg ataattttaa gcgagatccg 240 

actgaggtga aatccttgat ggagttaatg gaggaaaaga tcaatgaact gctagaaaac 300 

agaaatgtca tcaacaaggt taactgtaag atcaacttct gggggaactt ggacatgttg 360 

agcaaatcag tgagggtagc agctgagaaa ctgatggcta ccactgctga aaacacggga 420 

ctggtcttct ctgtttgcat gccatacaac tccacttctg agattgtcaa tgcggtcaat 480 

aaggtctgtg cagaaaggag ggatatactg cagagggagg atgctgacag tgttgcgaat 540 

aatggtgtgt attcagacat ttcagtggca gatctggacc gccatatgta cagcgctggt 600 

tgccccgatc ctgacattgt gatccggacc tcaggtgaga ctcgcctgag caatttcctt 660 

ctgtggcaga cgacgttcag tcatttgcag aatccagacc ctctttggcc ggagttctct 720 

ttcaagcacc ttgtctgggc catactccag taccaaagag ttcacccttc tattgagcaa 780 

agcagaaatc tggctaagaa gcagctgtaa tcacatcctc cctgggagga gatagaaacc 840 
atcatacaag atatctgtag ttacacaata atctgtattc tcctgtggta tctcctggaa 
tatgaaatat ataaaggata gctatgccat tgtatgcttg aacatgtgta tgcttgagtt 

ggtccaaatg tgtgaaatgt aataacattt ggtctaaaaa aaaaaaaaaa aaaaaaaaaa 1020 

1021 



900 
960 
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<210>/ 14 
<211> 252 
<212> PRT 

<213> Oryza sativa 
<400> 14 

Met Pro Lys His lie Ala Phe He Met Asp Gly Asn Arg Arg Tyr Ala 
15 10 15 

Lys Phe Arg Ser He Gin Glu Gly Ser Gly His Arg Val Gly Phe Ser 

20 25 30 

Ala Leu He Ala Ser Leu Leu Tyr Cys Tyr Glu Met Gly Val Lys Tyr 
35 40 45 

He Thr Val Tyr Ala Phe Ser He Asp Asn Phe Lys Arg Asp Pro Thr 
50 55 60 

Glu Val Lys Ser Leu Met Glu Leu Met Glu Glu Lys He Asn Glu Leu 
65 70 75 80 

Leu Glu Asn Arg Asn Val He Asn Lys Val Asn Cys Lys He Asn Phe 

85 90 95 

Trp Gly Asn Leu Asp Met Leu Ser Lys Ser Val Arg Val Ala Ala Glu 

100 105 110 

Lys Leu Met Ala Thr Thr Ala Glu Asn Thr Gly Leu Val Phe Ser Val 
115 120 125 

Cys Met Pro Tyr Asn Ser Thr Ser Glu He Val Asn Ala Val Asn Lys 
130 135 140 

Val Cys Ala Glu Arg Arg Asp He Leu Gin Arg Glu Asp Ala Asp Ser 
145 150 155 160 

Val Ala Asn Asn Gly Val Tyr Ser Asp He Ser Val Ala Asp Leu Asp 

165 170 175 

Arg His Met Tyr Ser Ala Gly Cys Pro Asp Pro Asp He Val lie Arg 

180 185 190 

Thr Ser Gly Glu Thr Arg Leu Ser Asn Phe Leu Leu Trp Gin Thr Thr 
195 200 205 

Phe Ser His Leu Gin Asn Pro Asp Pro Leu Trp Pro Glu Phe Ser Phe 
210 215 220 

Lys His Leu Val Trp Ala He Leu Gin Tyr Gin Arg Val His Pro Ser 
225 230 235 240 



He Glu Gin Ser Arg Asn Leu Ala Lys Lys Gin Leu 

245 250 



<210> 15 

<211> 900 

<212> DNA 

<213> Oryza sativa 

<400> 15 

atgcttggct cacttatgtc ttacttacct tcagtggatt caaagacgga gaacactgat 60 

gagttaattg cgactggtgt tcttgctagt ctgcagaatt tcatccgcaa atgcattgta 120 

gctgtcctct cgtatggccc aatgcctaag catattgcat ttattatgga tggtaaccgt 180 
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agatatgcta aattcaggag tatccaggaa ggctctggtc acagggtggg cttctctgct 24 0 

ctcattgcca gcctgctcta ctgctatgaa atgggcgtga agtatatcac ggtgtatgca 300 

tttagcatcg ataattttaa gcgagatccg actgaggtga aatccttgat ggagttaatg 360 

gaggaaaaga tcaatgaact gctagaaaac agaaatgtca tcaacaaggt taactgtaag 420 

atcaacttct gggggaactt ggacatgttg agcaaatcag tgagggtagc agctgagaaa 4 80 

ctgatggcta ccactgctga aaacacggga ctggtcttct ctgtttgcat gccatacaac 540 

tccacttctg agattgtcaa tgcggtcaat aaggtctgtg cagaaaggag ggatatactg 600 

cagagggagg atgctgacag tgttgcgaat aatggtgtgt attcagacat ttcagtggca 660 

gatctggacc gccatatgta cagcgctggt tgccccgatc ctgacattgt gatccggacc 720 

tcaggtgaga ctcgcctgag caatttcctt ctgtggcaga cgacgttcag tcatttgcag 780 

aatccagacc ctctttggcc ggagttctct ttcaagcacc ttgtctgggc catactccag 84 0 

taccaaagag ttcacccttc tattgagcaa agcagaaatc tggctaagaa gcagctgtaa 900 



<210> 16 
<211> 299 
<212> PRT 

<213> Oryza sativa 
<400> 16 

Met Leu Gly Ser Leu Met Ser Tyr Leu Pro Ser Val Asp Ser Lys Thr 
15 10 15 

Glu Asn Thr Asp Glu Leu lie Ala Thr Gly Val Leu Ala Ser Leu Gin 

20 25 30 

Asn Phe He Arg Lys Cys He Val Ala Val Leu Ser Tyr Gly Pro Met 
35 40 45 

Pro Lys His He Ala Phe He Met Asp Gly Asn Arg Arg Tyr Ala Lys 
50 55 60 

Phe Arg Ser He Gin Glu Gly Ser Gly His Arg Val Gly Phe Ser Ala 
65 70 75 80 

Leu He Ala Ser Leu Leu Tyr Cys Tyr Glu Met Gly Val Lys Tyr He 

85 90 95 

Thr Val Tyr Ala Phe Ser He Asp Asn Phe Lys Arg Asp Pro Thr Glu 

100 105 110 

Val Lys Ser Leu Met Glu Leu Met Glu Glu Lys He Asn Glu Leu Leu 
115 120 125 

Glu Asn Arg Asn Val He Asn Lys Val Asn Cys Lys He Asn Phe Trp 
130 135 140 

Gly Asn Leu Asp Met Leu Ser Lys Ser Val Arg Val Ala Ala Glu Lys 
145 150 155 160 

Leu Met Ala Thr Thr Ala Glu Asn Thr Gly Leu Val Phe Ser Val Cys 

165 170 175 

Met Pro Tyr Asn Ser Thr Ser Glu He Val Asn Ala Val Asn Lys Val 

180 185 190 

Cys Ala Glu Arg Arg Asp He Leu Gin Arg Glu Asp Ala Asp Ser Val 
195 200 205 

Ala Asn Asn Gly Val Tyr Ser Asp He Ser Val Ala Asp Leu Asp Arg 
210 215 220 

His Met Tyr Ser Ala Gly Cys Pro Asp Pro Asp He Val He Arg Thr 
225 230 235 240 
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Ser Gly Glu Thr Arg 

245 

Ser His Leu Gin Asn 

260 

His Leu Val Trp Ala 
275 

Glu Gin Ser Arg Asn 
290 



Leu Ser Asn Phe Leu Leu 

250 

Pro Asp Pro Leu Trp Pro 

265 

He Leu Gin Tyr Gin Arg 
280 

Leu Ala Lys Lys Gin Leu 
295 
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Trp Gin Thr Thr Phe 

255 

Glu Phe Ser Phe Lys 
270 

Val His Pro Ser He 
285 



<210> 17 

<211> 1028 

<212> DNA 

<213> Glycine max 

<400> 17 

ttcccactca gtggtgaatt tgccaaaccg 
tgttctcgtt aagactccct attcctctcg 
attctcacta ttatcactat cgttatcgtt 
cccaaacaca gagtcttatc gtctcgaagc 
atagcgtgac acttcgtgat gacggagtct 
cggcggaact cgcggcggag atgatgccga 
ggaggtgggc gaaggtgaag gggctgccac 
cgctgaggaa aatggtgagg ctgtgttgca 
cgttctctac ggataactgg gttcgcccca 
ttgagagaac aataaactct gaagttcaaa 
tgattggaga ttcatcaagg ttgcctgagt 
aggatacaaa acaaaattcg agattccaac 
atgatgttgt gcaagcatgt aaaagtgtag 
tggatgacat aaacgaaaac attattgaac 
cttatcctga tctactaata cgaactagtg 
ggcaattagc ctacacagaa ctttatttta 
atgagtttgt agatgcatta agttcatttc 
attcataa 



ggataaccgt atccctattc aggaatacaa 60 

ttaaaacacc accttctccc tcttgttatt 120 

atcgttgtta tcatcctttc catcaccgtt 180 

gcggttccgc cattgcgaag tgtcacgctg 240 

cgctcgccca agagtcgttg gagccacttc 300 

agcatgtggc ggtgataatg gacgggaacg 360 

catcggcggg gcaccaggcg ggggtgcaat 4 20 

gctggggaat taaggttcta acggttttcg 480 

aggtggaggt tgatttcttg atgaggctgt 540 

cttttaagag ggaaggaatt agaatatctg 600 

ctttaaaaag aatgatagct agtgcagaag 660 

ttattgtggc agtgggatac agtggaaaat 720 

ccaagaaagt caaagatggt cacattcact 780 

aagaattgga aactaattgt actgagtttc 84 0 

gcgagcttag agtgagtaac ttcttgttgt 900 

atcgggaact ctggccagat tttgggaagg 960 

aacaaagaca aagacgctat ggtggtcgac 1020 

1028 



<210> 18 

<211> 322 

<212> PRT 

<213> Glycine max 

<400> 18 

Met Phe Ser Leu Arg Leu Pro He Pro Leu Val Lys Thr Pro Pro Ser 
15 10 15 

Pro Ser Cys Tyr Tyr Ser His Tyr Tyr His Tyr Arg Tyr Arg Tyr Arg 

20 25 30 

Cys Tyr His Pro Phe His His Arg Ser Gin Thr Gin Ser Leu He Val 
35 40 45 

Ser Lys Arg Gly Ser Ala He Ala Lys Cys His Ala Asp Ser Val Thr 
50 55 60 

Leu Arg Asp Asp Gly Val Ser Leu Ala Gin Glu Ser Leu Glu Pro Leu 
65 70 75 80 

Pro Ala Glu Leu Ala Ala Glu Met Met Pro Lys His Val Ala Val He 

85 90 95 



12 



WO 01/21650 



PCT/USOO/25856 



Met Asp Gly Asn Gly Arg Trp Ala Lys Val Lys Gly Leu Pro Pro Ser 

100 105 110 



Ala Gly His Gin 
115 

Cys Cys Ser Trp 
130 



Ala Gly Val Gin 

120 

Gly lie Lys Val 
135 



Ser Leu Arg Lys 

Leu Thr Val Phe 

140 



Met Val Arg Leu 
125 

Ala Phe Ser Thr 



Asp Asn Trp Val Arg Pro Lys Val Glu Val Asp Phe Leu Met Arg Leu 
145 150 155 160 

Phe Glu Arg Thr lie Asn Ser Glu Val Gin Thr Phe Lys Arg Glu Gly 

165 170 175 



lie Arg He Ser 

180 

Lys Arg Met He 
195 

Phe Gin Leu He 
210 

Gin Ala Cys Lys 
225 

Leu Asp Asp He 



Val He Gly Asp 



Ala Ser Ala Glu 

200 

Val Ala Val Gly 
215 

Ser Val Ala Lys 
230 

Asn Glu Asn He 
245 



Ser Ser Arg Leu 
185 

Glu Asp Thr Lys 



Tyr Ser Gly Lys 

220 

Lys Val Lys Asp 
235 

lie Glu Gin Glu 
250 



Pro Glu Ser Leu 
190 

Gin Asn Ser Arg 
205 

Tyr Asp Val Val 



Gly His He His 

240 

Leu Glu Thr Asn 
255 



Cys Thr Glu Phe Pro Tyr Pro Asp 

260 

Leu Arg Val Ser Asn Phe Leu Leu 
275 280 

Tyr Phe Asn Arg Glu Leu Trp Pro 
290 295 



Leu Leu He Arg Thr Ser Gly Glu 
265 270 

Trp Gin Leu Ala Tyr Thr Glu Leu 

285 

Asp Phe Gly Lys Asp Glu Phe Val 

300 



Asp Ala Leu Ser Ser Phe Gin Gin Arg Gin Arg Arg Tyr Gly Gly Arg 
305 310 315 320 

His Ser 

<210> 19 
<211> 1026 
<212> DNA 

<213> Triticum aestivum 



<400> 19 

atgccgctct ccaactctac gtcgtctgtg ccggccgtca ccgtcccggc ggccgaggag 60 

ctcctctcac aagggctccg ggcggagtcg ctgccgcggc acgtggcgct ggtgatggac 120 

gggaactcgc ggtgggcggc agcgcggggc ctgccgccga cggacgggca cgagcacggg 180 

atgcgcgcgc tgatgaggac ggtgcggctc tcccgcgcct ggggcatccg cgtcctcacc 240 

gccttcggtt tctcgctcga gaactggaat cgccccaagg cggaggttga cttcttgatg 300 

gccttgatcg agaggtttat caacgacaac ctcgccgagt tcttgaggga agggacccgt 360 

ctacgtataa tcggtgaccg ctcaaggctg ccgatctctg tgcagaagac tgcacgagac 420 

gccgaggagg caacaagaaa caactcgcag ctcgatctag tcctagccat cagctacagc 480 

gggcgaatgg acattgtgca ggcatgccgg aatctcgccc agaaagtgga cgccaagctg 540 

ctcaggccgg aggacatcga cgagtcgctg ttcgccgacg agctccagac gagcgaaaca 600 

tcttgcccgg acctgctcat caggaccagc ggcgagctga ggctgagcaa cttcctgcta 660 

tggcagtcgg cttactcgga gctcttcttc accgacacgc tctggcctga tttcggggag 720 

gcccaatatc tccaagccat gatggccttc cagagcagag acaggcgctt tggaagaaga 780 

aaaaacaatg cagcgctata aataaacggt gcacgcgcgt gacccgatgc tcgatcatcc 84 0 
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tctatctatc tgtatctgcc tttataatca gtttttatta ccttcaaata aagtgtttct 900 

ctcaagatgc gtggtgtact ataggagagg ctactaaaac ttctctccag tgattttact 960 

ctatgctata tgctcattgt atttgatata gtttagcatt catgccgaaa aaaaaaaaaa 1020 

aaaaaa 1026 

<210> 20 
<211> 266 
<212> PRT 

<213> Triticum aestivum 
<400> 20 

Met Pro Leu Ser Asn Ser Thr Ser Ser Val Pro Ala Val Thr Val Pro 
15 10 15 

Ala Ala Glu Glu Leu Leu Ser Gin Gly Leu Arg Ala Glu Ser Leu Pro 

20 25 30 

Arg His Val Ala Leu Val Met Asp Gly Asn Ser Arg Trp Ala Ala Ala 
35 40 45 

Arg Gly Leu Pro Pro Thr Asp Gly His Glu His Gly Met Arg Ala Leu 
50 55 60 

Met Arg Thr Val Arg Leu Ser Arg Ala Trp Gly He Arg Val Leu Thr 
65 70 75 80 

Ala Phe Gly Phe Ser Leu Glu Asn Trp Asn Arg Pro Lys Ala Glu Val 

85 90 95 

Asp Phe Leu Met Ala Leu He Glu Arg Phe He Asn Asp Asn Leu Ala 

100 105 110 

Glu Phe Leu Arg Glu Gly Thr Arg Leu Arg He He Gly Asp Arg Ser 
115 120 125 

Arg Leu Pro He Ser Val Gin Lys Thr Ala Arg Asp Ala Glu Glu Ala 
130 135 140 

Thr Arg Asn Asn Ser Gin Leu Asp Leu Val Leu Ala He Ser Tyr Ser 
145 150 155 160 

Gly Arg Met Asp He Val Gin Ala Cys Arg Asn Leu Ala Gin Lys Val 

165 170 175 

Asp Ala Lys Leu Leu Arg Pro Glu Asp He Asp Glu Ser Leu Phe Ala 

180 185 190 

Asp Glu Leu Gin Thr Ser Glu Thr Ser Cys Pro Asp Leu Leu He Arg 
195 200 205 

Thr Ser Gly Glu Leu Arg Leu Ser Asn Phe Leu Leu Trp Gin Ser Ala 
210 215 220 

Tyr Ser Glu Leu Phe Phe Thr Asp Thr Leu Trp Pro Asp Phe Gly Glu 
225 230 235 240 

Ala Gin Tyr Leu Gin Ala Met Met Ala Phe Gin Ser Arg Asp Arg Arg 

245 250 255 

Phe Gly Arg Arg Lys Asn Asn Ala Ala Leu 

260 265 

<210> 21 
<211> 11 
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<212> , PRT 

<213>' Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Domain I of 
published alignment 



<220> 

<221> UNSURE 

<222> (2) . . (3) 

<223> X = any amino acid 

<220> 

<221> UNSURE 
<222> (8) 

<223> X = any amino acid 
<220> 

<221> UNSURE 
<222> { 10) 

<223> X = any amino acid 



<300> 

<301> Apfel, C. M. 

<302> Use of Genomincs to Indentify Bacterial Undecaprenyl 
Pyrophosphate Synthetase: Clooning, Expression, and 
Characterization of the Essential uppS Gene 

<303> J. Bacterid . 

<304> 81 

<306> 483-492 

<307> 1999 



<400> 21 

His Xaa Xaa Met Asp Gly Asn Xaa Arg Xaa Ala 
15 10 

<210> 22 
<211> 24 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Domain V of published 
alignment 

<220> 

<221> UNSURE 
<222> (3) 

<223> X = any amino acid 



<220> 

<221> UNSURE 
<222> (7) 

<223> X = any amino acid 
<220> 

<221> UNSURE 
<222> (10) 

<223> X = any amino acid 
<220> 

<221> UNSURE 
<222> (12) 

<223> X = any amino acid 
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<220> 

<221> UNSURE 

<222> (20) . . (21) 

<223> X = any amino acid 

<400> 22 

Asp Leu Xaa lie Arg Thr Xaa Gly Glu Xaa Arg Xaa Ser Asn Phe Leu 
15 10 15 

Leu Trp Gin Xaa Xaa Tyr Xaa Glu 

20 

<210> 23 
<211> 750 
<212> DNA 

<213> Micrococcus luteus 
<300> 

<301> Shimizu, N. 

<302> Molecular Cloning, Expression, and Purification of Undecprenyl 
Diphosphate Synthase: No Sequence Similarity between E- and 
Z-prenyl Diphosphate Synthases 

<303> J- Biol. Chem. 

<304> 273 

<306> 19476-19481 

<307> 1998 

<400> 23 

atgtttccaa ttaagaagcg aaaagcaata aaaaataata acattaatgc ggcacaaatt 60 
ccgaaacata ttgcaatcat tatggacgga aatggccgat gggcaaaaca gaaaaaaatg 120 
ccgcgcataa aaggacatta tgaaggcatg cagaccgtaa agaaaatcac aagatatgct 180 
agtgatttag gtgtaaagta cttaacgctg tacgcatttt caactgaaaa ttggtctcgt 240 
cctaaagatg aggttaatta cttgatgaaa ctaccgggtg attttttaaa cacattttta 300 
ccggaactca ttgaaaaaaa tgttaaagtt gaaacgattg gctttattga tgatttaccg 360 
gaccatacaa aaaaagcagt gttagaagcg aaagagaaaa cgaaacataa tacaggttta 420 
acgctcgtgt ttgcactgaa ttatggtggg cgtaaagaaa ttatttcagc agtgcagtta 4 80 
atcgcagagc gttacaaatc tggtgaaatt tctttagatg aaattagtga aactcatttt 540 
aatgaatatt tatttacagc aaatatgcct gatcctgagt tgttaatcag aacttccggt 600 
gaagaacgtt taagtaactt tttaatttgg caatgttcat atagtgagtt tgtatttata 660 
gatgaattct ggccggattt taatgaagaa agtttagcac aatgtatatc aatatatcag 720 
aatcgtcatc gacgttttgg tggattataa 750 

<210> 24 
<211> 249 
<212> PRT 

<213> Micrococcus luteus 
<400> 24 

Met Phe Pro lie Lys Lys Arg Lys Ala lie Lys Asn Asn Asn lie Asn 
15 10 15 

Ala Ala Gin lie Pro Lys His lie Ala lie lie Met Asp Gly Asn Gly 

20 25 30 

Arg Trp Ala Lys Gin Lys Lys Met Pro Arg lie Lys Gly His Tyr Glu 
35 40 45 

Gly Met Gin Thr Val Lys Lys lie Thr Arg Tyr Ala Ser Asp Leu Gly 
50 55 60 

Val Lys Tyr Leu Thr Leu Tyr Ala Phe Ser Thr Glu Asn Trp Ser Arg 
65 70 75 80 
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Pro Lys Asp Glu Val Asn Tyr Leu Met Lys Leu Pro Gly Asp Phe Leu 

85 90 95 

Asn Thr Phe Leu Pro Glu Leu lie Glu Lys Asn Val Lys Val Glu Thr 

100 105 110 

lie Gly Phe lie Asp Asp Leu Pro Asp His Thr Lys Lys Ala Val Leu 
115 120 125 

Glu Ala Lys Glu Lys Thr Lys His Asn Thr Gly Leu Thr Leu Val Phe 
130 135 140 

Ala Leu Asn Tyr Gly Gly Arg Lys Glu He He Ser Ala Val Gin Leu 
145 150 155 160 

He Ala Glu Arg Tyr Lys Ser Gly Glu He Ser Leu Asp Glu He Ser 

165 170 175 

Glu Thr His Phe Asn Glu Tyr Leu Phe Thr Ala Asn Met Pro Asp Pro 

180 185 190 

Glu Leu Leu He Arg Thr Ser Gly Glu Glu Arg Leu Ser Asn Phe Leu 
195 200 205 

He Trp Gin Cys Ser Tyr Ser Glu Phe Val Phe He Asp Glu Phe Trp 
210 215 220 

Pro Asp Phe Asn Glu Glu Ser Leu Ala Gin Cys He Ser He Tyr Gin 
225 230 235 240 

Asn Arg His Arg Arg Phe Gly Gly Leu 

245 

<210> 25 
<211> 861 
<212> DNA 

<213> Saccharomyces cerevisiae 
<300> 

<308> AB013497 
<400> 25 

atggaaacgg atagtggtat acctggtcat tcatttgtgt taaagtggac aaaaaacatc 60 
ttttcgcgca cattgcgtgc atctaactgt gtacctagac atgttgggtt catcatggat 120 
gggaacagga gattcgctag aaagaaagag atggacgtaa aggagggcca cgaggcagga 180 
tttgttagta tgagtagaat cttagaactg tgttatgaag caggagtcga tacggctacc 24 0 
gtgtttgcct tttcaattga aaatttcaag aggagctcac gggaagttga atcactgatg 300 
actttagcgc gcgaaaggat acgacaaatc acagaacgtg gagagctggc ctgtaagtat 360 
ggggtacgca ttaaaattat cggcgatctc tctttgttgg ataagtctct attagaagat 420 
gttcgggttg ctgtggaaac tacaaagaac aacaaaaggg ccacgttaaa tatctgcttt 4 80 
ccatatacag gcagggaaga aatcttgcat gccatgaaag aaacaattgt tcaacataag 540 
aagggcgccg ctatagacga aagcacgtta gaatcgcatc tctacacggc gggggtaccc 600 
cctttagatt tattgattag gacaagtggc gtttccagat taagtgactt tttgatatgg 660 
caggcatcga gtaagggcgt acgcatcgaa ttgctggatt gtttatggcc agagtttgga 720 
cctatacgga tggcatggat tttattaaaa ttttcgtttc acaaatcctt tttaaacaaa 780 
gagtacagat tagaggaagg tgattatgac gaggaaacca atggggaccc catcgatttg 84 0 
aaagaaaaaa agttgaatta a 861 

<210> 26 
<211> 286 
<212> PRT 

<213> Saccharomyces cerevisiae 
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<400>,- 26 

Met Glu Thr Asp Ser Gly He Pro Gly His Ser Phe Val Leu Lys Trp 
15 10 15 

Thr Lys Asn He Phe Ser Arg Thr Leu Arg Ala Ser Asn Cys Val Pro 

20 25 30 

Arg His Val Gly Phe He Met Asp Gly Asn Arg Arg Phe Ala Arg Lys 
35 40 45 

Lys Glu Met Asp Val Lys Glu Gly His Glu Ala Gly Phe Val Ser Met 
50 55 60 

Ser Arg He Leu Glu Leu Cys Tyr Glu Ala Gly Val Asp Thr Ala Thr 
65 70 75 80 

Val Phe Ala Phe Ser He Glu Asn Phe Lys Arg Ser Ser Arg Glu Val 

85 90 95 

Glu Ser Leu Met Thr Leu Ala Arg Glu Arg He Arg Gin He Thr Glu 

100 105 no 

Arg Gly Glu- Leu Ala Cys Lys Tyr Gly Val Arg He Lys He He Gly 
115 120 ' 125 

Asp Leu Ser Leu Leu Asp Lys Ser Leu Leu Glu Asp Val Arg Val Ala 
130 135 140 

Val Glu Thr Thr Lys Asn Asn Lys Arg Ala Thr Leu Asn He Cys Phe 
145 150 155 160 

Pro Tyr Thr Gly Arg Glu Glu He Leu His Ala Met Lys Glu Thr He 

165 170 175 

Val Gin His Lys Lys Gly Ala Ala He Asp Glu Ser Thr Leu Glu Ser 

180 185 190 

His Leu Tyr Thr Ala Gly Val Pro Pro Leu Asp Leu Leu He Arg Thr 
195 200 205 

Ser Gly Val Ser Arg Leu Ser Asp Phe Leu He Trp Gin Ala Ser Ser 
210 215 220 

Lys Gly Val Arg He Glu Leu Leu Asp Cys Leu Trp Pro Glu Phe Gly 
225 230 235 240 

Pro He Arg Met Ala Trp He Leu Leu Lys Phe Ser Phe His Lys Ser 

245 250 255 

Phe Leu Asn Lys Glu Tyr Arg Leu Glu Glu Gly Asp Tyr Asp Glu Glu 

260 265 270 

Thr Asn Gly Asp Pro He Asp Leu Lys Glu Lys Lys Leu Asn 
275 280 285 

<210> 27 
<211> 1032 
<212> DNA 

<213> Saccharomyces cerevisiae 
<300> 

<308> AB013498 
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<400> 27 

atgaaaatgc 

accaaagaac 

atgtcattaa 

ttaagggtag 

gccaagtcaa 

acactactgt 

attgaaaatt 

aagcttgatg 

ataagaatag 

gtggaagaaa 

tcaagaaatg 

tcaccaagga 

aaatgtgaat 

caagtacatg 

tttgctatgt 

gagaagaatc 

aagaaaacag 

ggagatgaat 



ccagtattat 
agatgtgctt 
gcttgttttc 
ggccagtgcc 
gaaggctacc 
atatctgcaa 
ttaatagacc 
aattcgcaaa 
taggtgatca 
tcacacagga 
atatgttaca 
ttaatataag 
tattaatcag 
aaaatgccac 
acctgatgat 
actcattgtt 
ctatgtcttt 
aa 



tcagattcag 
cgcagtgaaa 
atggttttat 
tgaacatgtc 
agtaaaaaaa 
aagattgggt 
aaaagaagaa 
aagagccaag 
atctttacta 
tggagacgat 
tactattcgt 
aaaatttact 
aacaagtggg 
cattgaattt 
tctcaaatgg 
tgaaaaaata 
gtacaacttt 



tttgtagccc 
agtatatttc 
gtaaatcttc 
tcctttatca 
ggccatgaag 
gtaaaatgtg 
gtagatacgc 
gactataagg 
tctccagaaa 
ttcactttat 
gattcagttg 
aataaaatgt 
cataggaggc 
agtgatacgt 
tccttctttt 
catgaaagcg 
ccaaaccccc 



taaaaaggct 
agagagtatt 
agaatattt t 
tggatggtaa 
ctggtgggtt 
tttccgccta 
taatgaattt 
atcccttata 
tgagaaaaaa 
ttatatgttt 
aagaccattt 
acatgggttt 
tctcagacta 
tgtggccaaa 
ccaccattca 
ttccttcaat 
ccatttcagt 



tttggtagaa 60 

tgcgtgggtt 120 

gataaaagca 180 

ccggagatat 24 0 

aacgttacta 300 

tgcattttct 360 

gtttacggta 420 

cggatctaaa 480 

aattaaaaaa 540 

tccttacact 600 

ggaaaataaa 660 

ccattccaat 720 

tatgctatgg 780 

ttttagcttc 840 

aaaatataat 900 

atttaaaaaa 960 

ttcggttaca 1020 
1032 



<210> 28 
<211> 343 
<212> PRT 

<213> Saccharomyces 



cerevisiae 



<400> 28 

Met Lys Met Pro Ser lie lie Gin He Gin Phe Val Ala Leu Lys Arg 
15 10 15 

Leu Leu Val Glu Thr Lys Glu Gin Met Cys Phe Ala Val Lys Ser He 

20 25 30 

Phe Gin Arg Val Phe Ala Trp Val Met Ser Leu Ser Leu Phe Ser Trp 
35 40 45, 

Phe Tyr Val Asn Leu Gin Asn He Leu He Lys Ala Leu Arg Val Gly 
50 55 60 

Pro Val Pro Glu His Val Ser Phe He Met Asp Gly Asn Arg Arg Tyr 
65 70 75 80 

Ala Lys Ser Arg Arg Leu Pro Val Lys Lys Gly His Glu Ala Gly Gly 

85 90 95 

Leu Thr Leu Leu Thr Leu Leu Tyr He Cys Lys Arg Leu Gly Val Lys 

100 105 110 

Cys Val Ser Ala Tyr Ala Phe Ser He Glu Asn Phe Asn Arg Pro Lys 
115 120 125 

Glu Glu Val Asp Thr Leu Met Asn Leu Phe Thr Val Lys Leu Asp Glu 
130 135 140 

Phe Ala Lys Arg Ala Lys Asp Tyr Lys Asp Pro Leu Tyr Gly Ser Lys 
145 150 155 160 

He Arg He Val Gly Asp Gin Ser Leu Leu Ser Pro Glu Met Arg Lys 

165 170 175 

Lys He Lys Lys Val Glu Glu He Thr Gin Asp Gly Asp Asp Phe Thr 

180 185 190 



Leu Phe He Cys Phe Pro Tyr Thr Ser Arg Asn Asp Met Leu His Thr 
195 200 205 
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lie Arg Asp Ser Val 
210 

Asn He Arg Lys Phe 
225 

Lys Cys Glu Leu Leu 

245 

Tyr Met Leu Trp Gin 

260 

Thr Leu Trp Pro Asn 
275 

Lys Trp Ser Phe Phe 
290 

Ser Leu Phe Glu Lys 
305 

Lys Lys Thr Ala Met 

325 

Val Ser Val Thr Gly 

340 

<210> 29 
<211> 32 
<212> DNA 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : primer 
<400> 29 

gctctagaga aggttaagtc agtttagcat eg 32 

<210> 30 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 30 

ggggtacctt attttaaata ttccttatgc ttctcc 36 

<210> 31 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence .-primer 
<400> 31 

gtggatccat gcttggctca cttatg 26 



Glu Asp His Leu Glu Asn Lys Ser Pro Arg He 
215 220 

Thr Asn Lys Met Tyr Met Gly Phe His Ser Asn 
230 235 240 

He Arg Thr Ser Gly His Arg Arg Leu Ser Asp 

250 255 

Val His Glu Asn Ala Thr He Glu Phe Ser Asp 

265 270 

Phe Ser Phe Phe Ala Met Tyr Leu Met He Leu 
280 285 

Ser Thr He Gin Lys Tyr Asn Glu Lys Asn His 
295 300 

He His Glu Ser Val Pro Ser He Phe Lys Lys 
310 315 320 

Ser Leu Tyr Asn Phe Pro Asn Pro Pro He Ser 

330 335 



Asp Glu 
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<210> 32 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 32 

ttgagctcta tctcctccca gggagg 

<210> 33 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 33 

acggatccat gttctcgtta agactcc 

<210> 34 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 34 

tcgagctctt atgaatgtcg accacc 

<210> 35 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 35 

ctagtctaga atctcccctc cgataaccaa aaaatcc 



<210> 36 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 36 

ggggtaccta gggtttaact tagaaactat ttag 



<210> 37 
<211> 1200 
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<212>' DNA 

<213> arabidopsis 

<400> 37 

tatatttgat taaaccagaa agaaagttta aacactaatc cctaatcagc aattttctcc 60 

cttcccctaa aaatcagccg tatcatatgc tcattccatt tgcattcccc acagaaagaa 120 

aagaaaaact tcattctctt gtttatattt cactcgcaac aaaaaaaaca aaaaaaaaca 180 

aagtgtgttc ttaaattatc ttctctgata accaaaaaag ccctattttc cgagatgaat 240 

accctagaag aagtagatga atccactcat atcttcaacg ctttgatgag tctaatgagg 300 

aaatttttgt tcagagttct atgcgtcggt ccaatcccta ctaacatttc attcatcatg 360 

gatggaaacc gcaggttcgc taagaaacac aatcttatag gcctagatgc aggacataga 420 

gctggtttca tatccgtgaa atatattctt caatactgca aagagattgg tgtaccgtac 480 

gtcacactcc acgcgtttgg tatggataat ttcaagagag gacctgaaga agtcaagtgt 540 

gtgatggatc taatgcttga gaaagtcgag ctcgcgatcg atcaagctgt atcagggaat 600 

atgaacggcg tgagaataat ctttgccggt gatttggatt cgttaaacga gcattttaga 660 

gctgcgacaa agaaactgat ggagcttacg gaggagaata gagatctgat tgtggtggtt 720 

tgcgttgctt acagcacaag tctcgagatt gttcacgctg ttcgaaaatc ttgtgttaga 780 

aaatgtacga atggagatga tcttgtactt ttggagttga gtgatgttga agagtgtatg 840 

tatacatcga ttgtgccggt tccggatctt gtgataagaa ccggaggagg agatcggctg 900 

agtaacttca tgacgtggca aacttcgagg tctcttcttc acagaacgga ggctctttgg 960 

ccggagttag ggctctggca tttggtttgg gcaattctta aattccaaag aatgcaagat 1020 

tacttgacga agaagaaaaa gctcgattag atagtttcta aagttaaacc ctgcaggaaa 1080 

gaacttttaa ctctttatta cgtttaattt acgtgtttct atgactggaa acgagaaagc 1140 

tcacaagcaa atctttttta ttatgtattg gatccgtata acaaacacga atatacaaaa 1200 
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